1 

TCG ACT ATG AAT GCT GAT ACT GCT CCA ACA 
ser thr met asn ala asp thr ala pro thr 

61 

TCA AGC TCC TGC TCC AGC TTC CAG GAC CAG 
ser ser ser cys ser ser phe gin asp gin 

121 

AGA GTA CCC GCC AGC AGC ACT TCC TCA CCG 
arg val pro ala ser ser thr ser ser pro 

181 

CTG GAT GCC GAA GGG GAA GGA ATC AGC GAA 

leu asp ala glu gly glu gly ile ser glu 

==D241 

AGC CTG CTA AGT TCT CAC GAC CTG GAC CCA 
ser leu leu ser ser his asp leu asp pro 

301 

ATC GCC GCC CTT TAC CTA CCT TTA GTT GGC 
■=^^ ile ala ala leu tyr leu pro leu val gly 

361 

GAC TTT ACA GTT GCA GAT ACT CGC AGA TAC 

n asp phe thr val ala asp thr arg arg tyr 

2 421 

GGA GCC GGT GCC ATT ACC CAG AAT GTG GCT 
gly ala gly ala ile thr gin asn val ala 

481 

AAA ACA AGT GGA ATA GTG CTG TCT TCC TTG 
lys thr ser gly ile val leu ser ser leu 

541 

GAC ACT ACT CGC AAC CTC ATG ATC TGC TTC 
asp thr thr arg asn leu met ile cys phe 

601 

CTC ATT AGG AAG TGG ATT GCT GAC CTG CCA 
leu ile arg lys trp ile ala asp leu pro 



31 

TCT CCT TGT CCT TCC ATA TCT TCC CAG AAC 
ser pro cys pro ser ile ser ser gin asn 

91 

AAG ATC GCC AGC ATG TTC GAT CGG ACT TCC 
lys ile ala ser met phe asp arg thr ser 

Cadherin 
151 I XX EC motif XX 1 

GGG CTC CTC TTC ACA GAA CTG GCT GCT GCC 
gly leu leu phe thr glu leu ala ala ala 

211 

GTA CAA AGG AAA GCT GTC AGT GCA ATT CAC 
val gin arg lys ala val ser ala ile his 

271 

CGC TGT GTC AAA CCA GAG GTG AAG GTC AAA 
arg cys val lys pro glu val lys val lys 

331 

ATC ATT TTG GAT GCT TTG CCA CAG CTC TGT 
ile ile leu asp ala leu pro gin leu cys 

391 

CGC ACC AGT GGC TCG GAT GAA GAA CAA GAA 

arg thr ser gly ser asp glu glu gin glu 
451 

CTG GCC ATA GCA GGG AAT AAT TTC AAT TTG 

leu ala ile ala gly asn asn phe asn leu 

511 

CCC TAT AAG CAG TAC AAC ATG CTG AAC GCG 
pro tyr lys gin tyr asn met leu asn ala 

571 

CTC TGG ATC ATG AAA AAT GCT GAT CAG AGC 
leu trp ile met lys asn ala asp gin ser 

631 

TCA ACG CAG CTC AAC AGG ATT TTA GAT CTA 

ser thr gin leu asn arg ile leu asp leu 
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661 691 

CTT TTC ATC TGT GTG TTA TGT TTT GAG TAT AAG GGA AAA CAG AGT TCT GAG AAA GTC AGT 
leu phe ile cys val leu cys phe glu tyr lys gly lys gin ser ser asp lys val ser 

721 751 

ACC CAA GTC CTG CAG AAG TCA AGG GAT GTC AAG GCC CGG CTG GAA GAG GCT TTG CTG CGT 
thr gin val leu gin lys ser arg asp val lys ala arg leu glu glu ala leu leu arg 

781 811 

GGG GAA GGG GCC AGA GGG GAG ATG ATG CGC CGC CGG GCT CCA GGG AAC GAC CGA TTT CCA 
gly glu gly ala arg gly glu met met arg arg arg ala pro gly asn asp arg phe pro 

841 871 

GGC CTA AAT GAA AAT TTG AGA TGG AAG AAA GAG CAG ACA CAT TGG CGG CAA GCT AAT GAG 
gly leu asn glu asn leu arg trp lys lys glu gin thr his trp arg gin ala asn glu 

901 931 

AAG CTA GAT AAA ACA AAG GCC GAG TTA GAT CAA GAA GCC TTG ATC AGT GGC AAT CTG GCT 
lys leu asp lys thr lys ala glu leu asp gin glu ala leu ile ser gly asn leu ala 



961 991 
i ACA GAA GCA CAT TTA ATC ATC CTG GAT ATG CAG GAA AAC ATT ATC CAG GCG AGC TCG GCT 
z thr glu ala his leu ile ile leu asp met gin glu asn ile ile gin ala ser ser ala 

:J 1021 1051 

I CTG GAC TGT AAA GAC AGC CTG CTG GGA GGT GTT CTG AGG GTG CTG GTG AAT TCT CTG AAC 
leu asp cys lys asp ser leu leu gly gly val leu arg val leu val asn ser leu asn 

^ 1081 1111 

TGT GAT CAG AGT ACC ACC TAC CTG ACT CAC TGC TTT GCA ACA CTC CGT GCT CTC ATC GCC 
Z cys asp gin ser thr thr tyr leu thr his cys phe ala thr leu arg ala leu ile ala 

i 1141 1171 

AAG- TTT GGA GAC TTA CTC TTC GAA GAG GAG GTG GAA CAG TGT TTC GAC CTA TGT CAC CAA 
lys phe gly asp leu leu phe glu glu glu val glu gin cys phe asp leu cys his gin 

1201 1231 

GTC CTG CAC CAC TGC AGC AGC AGC ATG GAT GTC ACC CGG AGC CAA GCC TGT GCC ACC CTT 
val leu his his cys ser ser ser met asp val thr arg ser gin ala cys ala thr leu 

1261 1291 

TAC CTC CTC ATG AGG TTC AGT TTT GGA GCC ACC AGT AAT TTT GCA AGA GTA AAG ATG CAA 
tyr leu leu met arg phe ser phe gly ala thr ser asn phe ala arg val lys met gin 

1321 1351 

GTA ACC ATG TCC CTG GCA TCT TTG GTG GGA AGA GCA CCA GAC TTT AAT GAA GAG CAC CTG 
val thr met ser leu ala ser leu val gly arg ala pro asp phe asn glu glu his leu 
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1381 1411 

AGA AGA TCC TTG AGG ACA ATT TTG GCC TAT TCA GAA GAG GAC ACA GCC ATG CAG ATG ACT 
arg arg ser leu arg thr ile leu ala tyr ser glu glu asp thr ala met gin met. thr 

1441 1471 

CCT TTT CCC ACC CAG GTG GAG GAA CTT CTC TGT AAT CTG AAT AGC ATC TTA TAT GAC ACA 
pro phe pro thr gin val glu glu leu leu cys asn leu asn ser ile leu tyr asp thr 

1501 1531 

GTG AAA ATG AGG GAA TTT CAG GAA GAT CCT GAG ATG CTT ATG GAT CTC ATG TAC AGA ATT 
val lys met arg glu phe gin glu asp pro glu loet leu met asp leu met tyr arg ile 

1561 1591 

GCC AAG AGT TAC CAG GCA TCT CCT GAT CTG CGG CTG ACC TGG CTC CAG AAC ATG GCA GAG 
ala lys ser tyr gin ala ser pro asp leu arg leu thr trp leu gin asn met ala glu 

1621 I xxxxxxxxxxxxxxxxxxxxxx transmembrane domain xxxxxxx 

AAA CAC ACC AAG AAG AAG TGC TAC ACG GAG GCT GCC ATG TGC CTG GTG CAC GCC GCT GCG 
lys his thr lys lys lys cys tyr thr glu ala ala met cys leu val his ala ala ala 

xxxxxxxxxxxxxxxxxxxxxxx I 1711 

TTA GTG GCT GAG TAT CTG AGC ATG CTG GAG GAC CAC AGC TAC CTG CCC GTG GGC AGT GTC 
leu val ala glu tyr leu ser met leu glu asp his ser tyr leu pro val gly ser val 

1741 1771 

AGC TTC CAG AAT ATT TCT TCC AAT GTG CTG GAG GAG TCT GTG GTC TCT GAG GAC ACC CTG 
ser phe gin asn ile ser ser asn val leu glu glu ser val val ser glu asp thr leu 

1801 1831 

TCA CCT GAC GAG GAT GGG GTG TGC GCA GGC CAG TAC TTC ACC GAG AGT GGC CTG GTA GGC 

ser pro asp glu asp gly val cys ala gly gin tyr phe thr glu ser gly leu val gly 

1861 1891 

CTC CTG GAG CAG GCC GCG GAG CTC TTC AGC ACG GGA GGC TTA TAT GAG ACA GTT AAT GAG 

leu leu glu gin ala ala glu leu phe ser thr gly gly leu tyr glu thr val asn glu 

1921 1951 

GTC TAC AAG CTG GTC ATC CCC ATC CTA GAA GCG CAT CGA GAA TTC CGG AAG CTG ACA CTC 
val tyr lys leu val ile pro ile leu glu ala his arg glu phe arg lys leu thr leu 

1981 2011 

ACT CAC AGC AAG CTG CAG AGA GCC TTC GAC AGC ATC GTT AAC AAG GAT CAT AAG AGA ATG 
thr his ser lys leu gin arg ala phe asp ser ile val asn lys asp his lys arg met 

2041 [xxxxx ITAM xxxxj 2071 

TTT GGA ACC TAC TTC CGA GTT GGT TTC TTT GGA TCC AAA TTT GGG GAT TTG GAT GAA CAG 

phe gly thr tyr phe arg val gly phe phe gly ser lys phe gly asp leu asp glu gin 
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2101 2131 

GAG TTT GTC TAG AAA GAG COT GCA ATT AGO AAG CTT CCT GAG ATC TCA CAT AGA CTA GAG 
glu phe val tyr lys glu pro ala ile thr lys leu pro glu ile ser his arg leu glu 

2161 2191 

GCA TTT TAT GGT CAA TGT TTT GGT GCA GAA TTT GTG GAA GTG ATT AAA GAC TCC ACT CCT 
ala phe tyr gly gin cys phe gly ala glu phe val glu val ile lys asp ser thr pro 

2221 2251 

GTG GAC AAA ACC AAG TTG GAT CCT AAC AAG GCC TAG ATA CAG ATC ACT TTT GTG GAG CCC 
val asp lys thr lys leu asp pro asn lys ala tyr ile gin ile thr phe val glu pro 

2281 2311 

TAG TTT GAT GAG TAT GAG ATG AAA GAC AGG GTC ACA TAC TTT' GAG AAG AAT TTC AAC CTC 
tyr phe asp glu tyr glu met lys asp arg val thr tyr phe glu lys asn phe asn leu 

2341 2371 

CGG AGG TTC ATG TAC ACC ACC CCG TTC ACC CTG GAG GGG CGG CCT CGG GGA GAG CTG CAT 

arg arg phe met tyr thr thr pro phe thr leu glu gly arg pro arg gly glu leu his 
2401 2431 

GAG CAG TAC AGA AGG AAC ACA GTC CTG ACC ACT ATG CAC GCC TTC CCC TAC ATC AAG ACC 

glu gin tyr arg arg asn thr val leu thr thr met his ala phe pro tyr ile lys thr 

2461 2491 Ixxxxxxxxxxxxxxxxxxxxxxx 

AGG ATC AGC GTC ATC CAG AAG GAG GAG TTT GTT TTG ACA CCG ATT GAA GTT GCC ATT GAA 
arg ile ser val ile gin lys glu glu phe val leu thr pro ile glu val ala ile glu 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx Coiled coil 1 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
GAC ATG AAG AAG AAG ACC CTG CAG TTA GCA GTT GCC ATT AAC CAG GAG CCG CCT GAT GCA 

asp met lys lys lys thr leu gin leu ala val ala ile asn gin glu pro pro asp ala 
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx I 2611 

AAG ATG CTT CAG ATG GTG CTG CAA GGC TCT GTG GGA GCT ACT GTA AAT CAG GGA CCA CTG 

lys met leu gin met val leu gin gly ser val gly ala thr val asn gin gly pro leu 

2641 2671 

GAA GTA GCC CAA GTG TTT TTG GCT GAA ATT CCT GCT GAT CCA AAA CTC TAT CGA CAT CAC 
glu val ala gin val phe leu ala glu ile pro ala asp pro lys leu tyr arg his his 

2701 2731 I xxxxxxxxxxx 

AAC AAG TTG AGG TTA TGC TTT AAG GAA TTC ATC ATG AGA TGT GGT GAA GCT GTA GAG AAA 
asn lys leu arg leu cys phe lys glu phe ile met arg cys gly glu ala val glu lys 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx Coiled coil 2 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
AAC AAG CGT CTC ATC ACG GCA GAC CAG AGG GAA TAT CAG CAG GAA CTC AAA AAG AAC TAT 
asn lys arg leu ile thr ala asp gin arg glu tyr gin gin glu leu lys lys asn tyr 
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X XX XXXX X xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 1 

AAC AAG CTA AAA GAG AAC CTC AGG CCA ATG ATC GAG CGG AAA ATT CCA GAA CTG TAC AAG 
asn lys leu lys glu asn leu arg pro met ile glu arg lys ile pro glu leu tyr lys 

2881 2911 

CCA ATA TTC AGA GTT GAG AGT CAA AAG AGG GAC TCC TTC CAC AGA TCT AGT TTC AGG AAA 
pro ile phe arg val glu ser gin lys arg asp ser phe his arg ser ser phe arg lys 

2941 2971 

TGT GAA ACC CAG TTG TCA CAG GGC AGC TAA GAA AAG CCA TCT TCA TTC GTG GAG ACT GTG 
cys glu thr gin leu ser gin gly ser OCH glu lys pro ser ser phe val glu thr val 

3001 3031 

GCC CTG CAA CCC TGG AGA AGG ACT TGC TGG TAC TTA AAA AAT GGG ACA TTT GCC ACC CAG 

ala leu gin pro trp arg arg thr cys trp tyr leu lys asn gly thr phe ala thr gin 
3061 3091 

GAC TGA CTG TAC ACT CCC TGA TCA GCC AGC ACT CTG GAA GCT TTG GGA TCC CAG GAA CCA 
asp STP 

3121 3151 

TGG AAT TAT TCC CAA ATG GAC TCT GAC CAG ATT TTT GCC ATA CTG GGG GGT GGC GGG ATG 
3181 3211 

GAG GAT GGG TAC TCA GGC ATG ACT GCG TAT TTA TTA AAG TGT GTT TTT CCA CAA TGT ACC 
3241 3271 

AAA CAA GGC ATA AGC AGC TTC TCC TGC TGA CTG GCC AAT CAC TGC CCA TCT GAG AGA TGA 
3301 3331 

TTT OCT CTG GCC CAT ATT TGA ATT TAT TGG AGT AAC TCA AAT TGC CTG AGG AAA AAT GGA 
3361 3391 

AAA ATT ATC CAC CAG TCG ATT CAA ACT GAA TTT CAC TCT TTA TAG GAA GGC AGG GCA AAC 
3421 3451 

TTG TAG GAG TAC GAA ACA TTT TCA ATA AAT CTA CAA AGG GAA GCC TTA CTA CAA TTC CAA 
3481 3511 

AAA TCA TCA TGG TTG GAA ATT TGG GAG GAG ATT ATT TGT GAA CTT GTT ACC CTT TTG GTA 
3541 3571 

ATG GTG GAC TAA TTG CTG TAT AGT TAT TTT TGT TTT ATT ATT ACT GTT ACA TTA ATT TAA 
3601 3631 

CAT GCA TTT ATA GAA GAA TAC ATT CAA AGC ACT GAT GTA GGA GAT ACA CGG TAC TTG GAG 
3661 3691 

CAG TCA GCC AAA AAT CAC AGA TAC TGC TTT CAC TTA AAT GGA AAC AAT TCT CCG ATA ATG 
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3721 3751 

CTT TGC TTT TTT TCT TAT GTC ACT CTT GTG TAC TAT CTA TTT TTC TCC TCT CTG GGA CCA 
3781 3811 

AGT TTC TTT TTA TAA AGO AAT AAT ATC TCT GTT TTC ATT TCA GAA CAT TGT GCT GTC TGT 
3841 3871 

CAG CAT ATG TAT ATC AGC TAC AAA ATA TAT TCA ACT TTG ACT TCT TTT GAC AAA GGA CTT 
3901 3931 

TAG GAA AAG GAG GAA CAA AGA CAT TAT TTG AGA ATT AAA TTA TAT ATT TTT AAT ATG ACT 
3961 3991 

GTG ACC TTG ACT GAT AAT AAA GAT GTA ATA AGA ATT GCA AGC TAA AAA AAA AAA AAA AAA 

4021 
AAC TCG 
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HC2A 

KIAA ASGNLDKNARFSMYRQDSNKLSNDDMLKLLADFBKPEKMAKLPVILGN^^ 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A 

KIAA FPNyVNSSyiPTKQraTCSKTPITFEVIIEFVPCIPKHTQPYTIYTNHLYVYPKYIJ<YBSO 

rat 

HC4 

HCl 

HC3 ■ 

HC5 



HC2A VLHHHQNPEFYDEIK 

KIAA KSFAKAiy^IAICIEFKDSDEEDSQPLKCIYGRPGGPVFTRSAFAAVLHHHQNPEFYDEIK 

rat 

KC4 

HCl 

HC3 

HC5 



HC2A lELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDWETQVGYSWLPLLKDGRWTSEQHI 

KIAA lELPTQIx^KHHLLLTFFHVSCDNSSKGSTKKRDVVETQVGYSWLPIJ^^ 

rat 

HC4 

HCl 

HC3 

HC5 

HC2A PVSANLPSGYLGYQELGMGRHYGPEIKWVDGGKPLLKISTHLVSTVYTQDQHLHNFFQYC 

KIAA PVSANLPSGYLGYQELGMGRHYGPEIKWVDGGKPLLKISTHLVSTVYTQBQHLHNFFQYC 

rat 

HC4 

HCl 

HC3 GPGPARSTVSISIilSNSARV 

HC5 



HC2A QKTESGAQALGNELVKYLKSLHAMEGHVMIAFLPTILNQLFRVLT-RATQEEVAVNVTRV 

KIAA QKTESGAQALGNELVKYIJCSIJiAMEGHVMIAFliPTIIJ^QLFRViT-RATQEEV 

rat 

HC4 MEIQVLIRFLSVILMQLFWVLPNMIHEDDVPISCPMV 

HCl MS FLP 1 1 LNQL FKVLV-QNEEDE I TTTVTRV 

HC3 NRSR5LSNSNPDISGTPTSPDDEVRSIIGSKGLDRSNSWVNTGGPKA?VPWGSNPSPSAES 

HC5 



A 
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* HC2A * IIHWAQCHEEGLESHLRSyVKYAyKAEPYVASEYKTVHEELTKSMTTILKPSADFLTSN 

KIAA J lifWAQCHEEGIJISHIJlSyVKYAyKAEPYVASEYKTVHEELTKSMTTII^ 

rat 

HC4 LFHIVSKCflEEGLDSYLSSFIKYSFRPGKPSAPQAPLIHETIATMMIALLKQSADFIAIN 

HCl LPDIVAKCHEEQLDHSVQSYIKFVFKTR ACKERPVHEDLAKNVTGLLK-SNDSPTVK 

HC3 TQ^DRSCI^RMSSHTETSSFLQTLTGRLP TKKLFHEELALQ57WCSG — SVR E 

HC5 

Cadherin 

Cl eava ge 

HC2A KLLRYSWFFFDVLIKSMAQHLIENSKVKLl|EmOFjFPASYHfIAAETVV^ 

KIAA KLLKYSWFFFDVLIKSMAQHLIENSKVKLI RNQP FPAS YHHAVETWNMLMPHITQKFRD 

rat 

HC4 KLLKYSWFFFEI IAKS^5ATYLLEENKIKLI HGQP FPKAYHHALHSLFIAIT-IVESQYAE 

HCl HVLKHSWFFFAI ILKSHAQHLXDTNKIQLE RPQP FPESYQNELDNLVMVLSDHVIWKYKD 

HC3 SALQQAWFFFEIiWKSMVHHLYraDKLEM RKSPFPERFMDDIAALVSTIASDIVSRFQK 

HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



NPEASKNANHSLAVFIKRCFTfm>RGFVFKQIN NYIS—CFAPGDPKTLFEYKFEFL 

NPEASKNANHSLAVFIKRCFTFMDRGFVKKQIN NYIS— CFAPGDPKTLF^IYKFEFL 

IPKESRNVNYSLASFLKCCLTLMDRGFVFNLIN DYIS — GFSPKDPKVLAEYKFEFL 

ALEE TRRATHS VARFLKRC FT FMDRGCVFKMVN NYI S — MFSSGDLKTLCQYKFDFL 

DTEMVERLNTSLAFFLNDLLSVMDRGFVFSLIKSCYKQVSSKLYSLPNPSVLVSLRLDFL 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



RWCNHEHYI PLNLPM- 
RWCNHEHYIPLNLPM- 



-PFGKGRIQR- 
-PFGKGRIQR- 



-YQDLQL DYSLTDEF 

-YQDLQL DYSLTDEF 



QTICNHEHYIPLNLPM AFAKPKLQR VQDSNL EYSLSDEY 

QEVCQHEHFIPLCLPIRSANIPDPLTPSES TQELHASDMPEYSVTNEF 

RIICSHEHYVTLNLPCSLLTPPASPSPSVSSATSQSSGFSTNVQDQKIANMFELS — VPF 
HNADTAPTSPCPSIS SQNSSSCSSFQDQKIASMFDRTSRVPA 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



Cadherin 
EC motif 



CRNHFLVGL 
CRNHFLVGL 


LLRE 
LLRE 


VG TALQE FRE VRL I AI S VLKNLL I KHS FDDRYASRSHQAR I AT 

7GTALQEFRE VRLIAISVLKNLLIKHSFDDRYASRSHQARIAT 


CKHHFLVGL 
CRKHFLIGI 
RQQHYLAGI 
SSTS-SPGI 


LLRE 
LLRE 
VXTE 
LFTE 


TSIALQDNYE IRYTAISVIKNLLIKHAFDTRYQHKNQQAKIAQ 

VG FALQE DQD VRHLALAVLKNLMAKHS FDDRYRE PRKQAQI AS 

LAVILDPDAEGLFGLHKKVINMVHNLLSSHDSDPRYSDPQIKARVAM 
LAAALDAEGEGISEVQRKAVSAIHSLLSSHDLDPRCVKPEVKVKIAA 



HC2A LYLPLFGLLIENVQRINVRDVSPFPVNAG-MTVKDESLALPAVNPLVTPQKGSTLDNSLH 

KU^ LYLPLFGLLIENVOKINVRDVSPFPVNAG-MTVKDESLALPAVNPLVTPQKGSTI^^ 

rat 

HC4 LYLPFVGLLLENIQRLAGRDTLYSCAAMPNSASRDEFPCG FTSP — AN— RGSLS 

HCl LYMPLYGMLLDNMPRIYLKDLYPFTVNTSNQGSRDDLSTNGGFQSQTAIKHANSVDTSFS 

HC3 LYLPLIGIIMETVPQLYDFTETHNQRGRPICIATDDYESE SG SMIS 

HC5 LYLPLVGIILDALPQLCDFTVADTRRYR TSGSDEEQE GA GAIT 



A 
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HC2A KDLLGAISGIASPYTTSTPNINSVKKADSRGSLISTDSGNSLPERNSEKSNSLDKHQQSS 

KIAA • KDLLGAISGIASPYTTSTPKINSVRNADSKGSLISTDSGNSLPERNSEKSNSIiDKHQQSS 

rat 

HC4 TDKDTAYGSFQNG-^ HGIKMDSRGSLIP-EGATGFPDQGNTGEN TRQS 

HCl KDVLNSIAAFSS lAISTVNHADSRASLASLDSNPSTNEKSSEKTDNCEKIPRPL 

HC3 QTVAMAIAGT5VPQ LTRPGSFLLTSTSGRQHT 

HC5 ONYALAI AGNNFN LKTSG-IVLSSLPYKQYN 

HC2A TLGNSVVRCDKLDQSEIKSLIJyiCFLYILKSMSDDALFTYWN-KASTSELMDFFTISEVCL 

KIAA TLGNSVVRCI)KLI)QSEIKSLLMCFLYILKSMSDDALFTYWN-KASTSEliMDFFTISEVCL 

rat 

HC4 STRSSVSQYNRLDQYEIRSLLMCYLYIVKMISEDTLLTYWN-KVSPQELIKILILLEVCL 

HCl ALIGSTLRFDRLDQAETRSLLMCFLHIMKTISYETLIAYWQ-RAPSPEVSDFFSILDVCL 

HC3 T FSAES SRSLL I C LLWVLKN-ADETVLQKW FT DLS VLQLNRLLDLL YLCV 

HC5 MLNADTTRNmiCFLWIHKN-ADQSLIRKWIADLPSTQLNRILDLLFICV 

HC2A HQFQYMGKRYIARNQEGLG—PIVHDRKS QTLPVSRNRTGMM 

KIAA HQFQYMGKRYIAR TGMM 

rat 

HC4 FHFRYMGKRNIARVHDAWLSKHFGIBRKS QTMPALRNRSGVM 

HCl QNFRYLGKRNI IRKIAAAF — KFVQSTQNNGTLKGSNPSCQTSGLLAQP3MHSTSRHEGHK 

HC3 SCFEYKGKKVFERMNSLTFK—KSKDMRAK LEEAILGSIGARQEMV 

HC5 LCFEYKGKQSSDKVSTQVLQ — KSRDVKAR LEEALLRGEGARGEMM 

HC2A HARLQQL GSLDNS LTFNHSYGHSDADVLHQSLLEANIATEVC 

KXAA HARiQQL GSLDNS LTFNHSYGHSmDVLHQSLLEANIATEVC 

rat 

HC4 QARLQHL SSLESS FTLNHSSTTTEADIFHQALLEGNTATEVS 

HCl QHRSQTLPIIKGK NALSNPKL LC^LDNTMTSNSNEIDIVHHVDTEANIATEGC 

HC3 RRSRGQLERSPSGSAFGSQENLRWRKDMTHWRQNTEKLDKSRAEIEHEALIDGNLATEAN 

HC 5 RRRAPGNDRFP GLNENLRWKKE QTHWRQANEKLDKTKAELDQEAL I S GNLATEAH 

HC2A LTALDTLSLFTLAFKNQLLADHGHNPLMKKVFDVYLCFLQKHQSETALKNVFTALRSLIY 

KIAA LTALI5TLSLFTLAFKNQLIJU:)HGHNPlJ*iKKVFDVYLCFLQKHQSETAI^ 

rat KLSRGHSPLMKKVFDVYLCFLQKHQSEMALKNVFTALRSLIY 

HC4 LTVLDTISFFTQCFKTHFLNNDGHNPLMKKVFDIHLAFLKNGQSEVSLKHVFASLRAFIS 

HCl LT I LDLVSLFTQTHQRQLQQCDCQNSLMKRGFDT YMLFFQVNQSATALKHVFASLRLFVC 

HC 3 L 1 1 LDTLE I WQTVS - - VTES — KES I LGGVLKVLLHSMACNQS AVYLQHC FATQRALVS 

HC5 LIILDMQENIIQASS — ALDC — KDSLLGGVLRVLVNSLNCDQSTTYLTHCFATLRALIA 

HC2A KFPSTFYEGRADMCAALCYEILKCCNSKLSSIRTEASQLLYFLMRNNFDYTGKKSFVRTH 

KIAA KFPSTFYEGRADMCAALCYEIXJ^CGNSKLSSIRTEASQLLYFLMRNNFDYTGKKSFVRTH 

J^at KFPSTFYEGRADMCASLCYEVXKCCNSKI^SIRTEASQLLYFIJtdRNNFDYTGKKSFVRTH 

HC 4 KFPSAFFKGRVNMCAAFCYEVLKCCTSKISSTRNEASALLYLLMRNNFEYTKRKTFLRTH 

HC 1 KFPS AFFQGPADLCGS FC YEVXKCCNHRSRS TQTEASALLYLFMRKNFE FNKQKS I VRSH 

HC3 KFPELLFEEETEQCADLCLRLLRHCSSSIGTIRSHPSASLYLLMRQNFEIGN—NFARVK 

HC5 KFGDLLFEEEVEQCFDLCHQVLHHCSSSMDVTRSQACATLYLLMRFSFGATS — NFARVK 

HC2A LQVIISVSQLIADWGIGETRFQQSLSIINNCANSDRLIKHTSFSSDVKDLTKRIRTVLM 

KIAA LQVI IS VSQLIADWGI GGTRFQQSLSI II^fCANSDRLIKHTSFSSBVKDLTKRIRTVLM 

rat LQVIISLSQLIADWGIGGTRFQQSI^IINNCANSDRLIKHTSFSSDVKDLTKRIRTVLM 

HC4 LQIIIAVSQLIADVALSGGSRFQESLFIINNFANSDRPMLARAFPAEVKDLTKRIRTVLM 
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ref 1.1 

1 31 

TCG ACT ATG AAT GCT GAT ACT GCT CCA ACA TCT CCT TGT CCT TCC ATA TCT TCC GAG AAC 
ser thr met asn ala asp thr ala pro thr ser pro cys pro ser ile ser ser gin asn 



61 91 

TCA AGC TCC TGC TCC AGC TTC CAG GAC CAG AAG ATC GCC AGC ATG TTC GAT CGG ACT TCC 
ser ser ser cys ser ser phe gin asp gin lys ile ala ser met phe asp arg thr ser 

121 151 

AGA GTA CCC GCC AGC AGC ACT TCC TCA CCG GGG CTC CTC TTC ACA GAA CTG GCT GCT GCC 
arg val pro ala ser ser thr ser ser pro gly leu leu phe thr glu leu ala ala ala 

181 211 

CTG GAT GCC GAA GGG GAA GGA ATC AGC GAA GTA CAA AGG AAA GCT GTC AGT GCA ATT CAC 
leu asp ala glu gly glu gly ile ser glu val gin arg lys ala val ser ala ile his 

241 271 

AGC CTG CTA AGT TCT CAC GAC CTG GAC CCA CGC TGT GTC AAA CCA GAG GTG AAG GTC AAA 
ser leu leu ser ser his asp leu asp pro arg cys val lys pro glu val lys val lys 

l301 331 

CiVTC GCC GCC CTT TAC CTA CCT TTA GTT GGC ATC ATT TTG GAT GCT TTG CCA CAG CTC TGT 
I /ile ala ala leu tyr leu pro leu val gly ile ile leu asp ala leu pro gin leu cys 



;1361 391 

'fiAC TTT ACA GTT GCA GAT ACT CGC AGA TAC CGC ACC AGT GGC TCG GAT GAA GAA CAA GAA 
lasp phe thr val ala asp thr arg arg tyr arg thr ser gly ser asp glu glu gin glu 

1^421 451 

NGGA GCC GGT GCC ATT ACC CAG AAT GTG GCT CTG GCC ATA GCA GGG AAT AAT TTC AAT TTG 
nCflv ala gly ala ile thr gin asn val ala leu ala ile ala gly asn asn phe asn leu 

I ref 2.1 
511 

;:JAA ACA AGT GGA ATA GTG CTG TCT TCC TTG CCC TAT AAG CAG TAC AAC ATG CTG AAC GCG 
■"f:iys thr ser gly ile val leu ser ser leu pro tyr lys gin tyr asn met leu asn ala 

541 571 

GAC ACT ACT CGC AAC CTC ATG ATC TGC TTC CTC TGG ATC ATG AAA AAT GCT GAT CAG AGC 
asp thr thr arg asn leu met ile cys phe leu trp ile met lys asn ala asp gin ser 

.601 631 

CTC ATT AGG AAG TGG ATT GCT GAC CTG CCA TCA ACG CAG CTC AAC AGG ATT TTA GAT CTA 
leu ile arg lys trp ile ala asp leu pro ser thr gin leu asn arg ile leu asp leu 

661 691 

CTT TTC ATC TGT GTG TTA TGT TTT GAG TAT AAG GGA AAA CAG AGT TCT GAC AAA GTC AGT 
leu phe ile cys val leu cys phe glu tyr lys gly lys gin ser ser asp lys val ser 

721 751 

ACC CAA GTC CTG CAG AAG TCA AGG GAT GTC AAG GCC CGG CTG GAA GAG GCT TTG CTG CGT 
thr gin val leu gin lys ser arg asp val lys ala .arg leu glu glu ala leu leu arg 

781 811 

GGG GAA GGG GCC AGA GGG GAG ATG ATG CGC CGC CGG GCT CCA GGG AAC GAC CGA TTT CCA 
gly glu gly ala arg gly glu met met arg arg arg ala pro gly asn asp arg phe pro 
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1341 871 

GGC CTA AAT GAA AAT TTG AGA TGG AAG AAA GAG CAG ACA CAT TGG CGG CAA GCT AAT GAG 
gly leu asn glu asn leu arg trp lys lys glu gin thr his trp arg gin ala asn glu 

901 931 

'AAG CTA GAT AAA ACA AAG GCC GAG TTA GAT CAA GAA GCC TTG ATC AGT GGC AAT CTG GCT 
iys leu asp lys thr lys ala glu leu asp gin glu ala leu ile ser gly asn leu ala 

961 991 

ACA GAA GCA CAT TTA ATC ATC CTG GAT ATG CAG GAA AAC ATT ATC CAG GCG AGC TCG GCT 
thr glu ala his leu ile ile leu asp met gin glu asn ile ile gin ala ser ser ala 

1021 1051 

CTG GAC TGT AAA GAC AGC CTG CTG GGA GGT GTT CTG AGG GTG CTG GTG AAT TCT CTG AAC 
leu asp cys lys asp ser leu leu gly gly val leu arg val leu val asn ser leu asn 

I ref 3,1 

1031 1111 4 

TGT GAT CAG AGT ACC ACC TAC CTG ACT CAC TGC TTT GCA ACA CTC CGT GCT CTC ATC GCC 
cys asp gin ser thr thr tyr leu thr his cys phe ala thr leu arg ala leu ile ala 



_1141 1171 

4^AG TTT GGA GAC TTA CTC TTC GAA GAG GAG GTG GAA CAG TGT TTC GAC CTA TGT CAC CAA 
|=^iys phe gly asp leu leu phe glu glu glu val glu gin cys phe asp leu cys his gin 

Ui201 1231 

ijfeTC CTG CAC CAC TGC AGC AGC AGC ATG GAT GTC ACC CGG AGC CAA GCC TGT GCC ACC CTT 

Lfyal leu his his cys ser ser ser met asp val thr arg ser gin ala cys ala thr leu 

;ii2 61 1291 

^;;TAC CTC CTC ATG AGG TTC AGT TTT GGA GCC ACC AGT AAT TTT GCA AGA GTA AAG ATG CAA 

r Xyr leu leu met arg phe ser phe gly ala thr ser asn phe ala arg val lys met gin 

rU 321 1351 

!="=^GTA ACC ATG TCC CTG GCA TCT TTG GTG GGA AGA GCA CCA GAC TTT AAT GAA GAG CAC CTG 
l#al thr met ser leu ala ser leu val gly arg ala pro asp phe asn glu glu his leu 



-a381 1411 

"kCA AGA TCC TTG AGG ACA ATT TTG GCC TAT TCA GAA GAG GAC ACA GCC ATG CAG ATG ACT 
arg arg ser leu arg thr ile leu ala tyr ser glu glu asp thr ala met gin met thr 

1441 1471 

CCT TTT CCC ACC CAG GTG GAG GAA CTT CTC TGT AAT CTG AAT AGC ATC TTA TAT GAC ACA 
pro phe pro thr gin val glu glu leu leu cys asn leu asn ser ile leu tyr asp thr 

1501 1531 

GTG AAA ATG AGG GAA TTT CAG GAA GAT CCT GAG ATG CTT ATG GAT CTC ATG TAC AGA ATT 
val lys met arg glu phe gin glu asp pro glu met leu met asp leu met tyr arg ile 

1561 1591 

GCC AAG AGT TAC CAG GCA TCT CCT GAT CTG CGG CTG ACC TGG CTC CAG AAC ATG GCA GAG 
ala lys ser tyr gin ala ser pro asp leu arg leu thr trp leu gin asn met ala glu 

1621 1651 

AAA CAC ACC AAG AAG AAG TGC TAC ACG GAG GCT GCC ATG TGC CTG GTG CAC GCC GCT GCG 
lys his thr lys lys lys cys tyr thr glu ala ala met cys leu val his ala ala ala 
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1681 

TTA GTG GCT GAG TAT CTG AGC ATG CTG GAG 
leu val ala glu tyr leu ser met leu glu 

1741 

AGC TTC CAG AAT ATT TCT TCC AAT GTG CTG 
ser phe gin asn ile ser ser asn val leu 

1801 

TCA CCT GAC GAG GAT GGG GTG TGC GCA GGC 
ser pro asp glu asp gly val cys ala gly 

1861 

CTC CTG GAG CAG GCC GOG GAG CTC TTC AGC 
leu leu glu gin ala ala glu leu phe ser 

1921 

GTC TAG AAG CTG GTC ATC CCC ATC CTA GAA 
val tyr lys leu val ile pro ile leu glu 

1981 

ACT CAC AGC AAG CTG CAG AGA GCC TTC GAC 
U:?thr his ser lys leu gin arg ala phe asp 

=^42041 

yTTT GGA ACC TAC TTC CGA GTT GGT TTC TTT 
rjlphe gly thr tyr phe arg val gly phe phe 

SJ21C1 

^:L;GAG TTT GTC TAC AAA GAG CCT GCA ATT ACC 
""^"glu phe val tyr lys glu pro ala ile thr 

rllGCA TTT TAT GGT CAA TGT TTT GGT GCA GAA 
y^ala phe tyr gly glr^ cys phe gly ala glu 

;;rGTG GAC AAA ACC AAG TTG GAT CCT AAC AAG 
"==^val asp lys thr lys leu asp pro asn lys 

2231 

TAC TTT GAT GAG TAT GAG ATG AAA GAC AGG 
tyr phe asp glu tyr glu met lys asp arg 

2341 

CGG AGG TTC ATG TAC ACC ACC CCG TTC ACC 
arg arg phe met tyr thr thr pro phe thr 

2401 

GAG CAG TAC AGA AGG AAC ACA GTC CTG ACC 
glu gin tyr arg arg asn thr val leu thr 

2461 

AGG ATC AGC GTC ATC CAG AAG GAG GAG TTT 
arg ile ser val ile gin lys glu glu phe 



1711 

GAC CAC AGC TAC CTG CCC GTG GGC AGT GTC 
asp his ser tyr leu pro val gly ser val 

1771 

GAG GAG TCT GTG GTC TCT GAG GAC ACC CTG 
glu glu ser val val ser glu asp thr leu 

1831 

CAG TAC TTC ACC GAG AGT GGC CTG GTA GGC 
gin tyr phe thr glu ser gly leu val gly 

1891 

ACG GGA GGC TTA TAT GAG ACA GTT AAT GAG 
thr gly gly leu tyr glu thr val asn glu 

1951 

GCG CAT CGA GAA TTC CGG AAG CTG ACA CTC 
ala his arg glu phe arg lys leu thr leu 

2011 

AGC ATC GTT AAC AAG GAT CAT AAG AGA ATG 
ser ile val asn lys asp his lys arg met 

2071 

GGA TCC AAA TTT GGG GAT TTG GAT GAA CAG 
gly ser lys phe gly asp leu asp glu gin 

2131 

AAG CTT CCT GAG ATC TCA CAT AGA CTA GAG 
lys leu pro glu ile ser his arg leu glu 

2191 

TTT GTG GA?^ GTG ATT AAA GAC TCC ACT CCT 
phe val glu val ile lys asp ser thr pro 

2251 

GCC TAC ATA CAG ATC ACT TTT GTG GAG CCC 
ala tyr ile gin ile thr phe val glu pro 

2311 

GTC ACA TAC TTT GAG AAG AAT TTC AAC CTC 
val thr tyr phe glu lys asn phe asn leu 

2371 

CTG GAG GGG CGG CCT CGG GGA GAG CTG CAT 
leu glu gly arg pro arg gly glu leu his 

2431 

ACT ATG CAC GCC TTC CCC TAC ATC AAG ACC 
thr met his ala phe pro tyr ile lys thr 

2491 

GTT TTG ACA CCG ATT GAA GTT GCC ATT GAA 
val leu thr pro ile glu val ala ile glu 
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2521 

GAC ATG AAG AAG AAG ACC CTG CAG TTA GCA 
asp met lys lys lys thr leu gin leu ala 

2581 

AAG ATG CTT CAG ATG GTG CTG CAA GGC TCT 
lys met leu gin met val leu gin gly ser 

2641 

GAA GTA GCC CAA GTG TTT TTG GCT GAA ATT 
glu val ala gin val phe leu ala glu ile 

2701 

AAC AAG TTG AGG TTA TGC TTT AAG GAA TTC 

asn lys leu arg leu cys phe lys glu phe 
2761 

AAC AAG CGT CTC ATC ACG GCA GAC CAG AGG 
asn lys arg leu ile thr ala asp gin arg 

2821 

^|AAC AAG CTA AAA GAG AAC CTC AGG CCA ATG 
5asn lys leu lys glu asn leu arg pro met 

^2881 

JJCCA ATA TTC AGA GTT GAG AGT CAA AAG AGG 
^"pro ile phe arg val glu ser gin lys arg 

M2941 

:|TGT GAA ACC CAG TTG TCA CAG GGC AGC TAA 
cys glu thr gin leu ser gin gly ser OCH 

Iref 5.1 

'fC-CC CTG CAA CCC TGG AGA AGG ACT TGC TGG 
y 3061 

3 GAC TGA CTG TAG ACT CCC TGA TCA GCC AGC 
" 3121 

TGG AAT TAT TCC CAA ATG GAC TCT GAC CAG 
3181 

GAG GAT GGG TAC TCA GGC ATG ACT GCG TAT 
3241 

AAA CAA GGC ATA AGC AGC TTC TCC TGC TGA 
3301 

TTT CCT CTG GCC CAT ATT TGA ATT TAT TGG 
3361 

AAA ATT ATC CAC CAG TCG ATT CAA ACT GAA 
3421 

TTG TAG GAG TAC GAA ACA TTT TCA ATA AAT 
3481 



2551 

GTT GCC ATT AAC CAG GAG CCG CCT GAT GCA 
val ala ile asn gin glu pro pro asp ala 

2611 

GTG GGA GCT ACT GTA AAT CAG GGA CCA CTG 
val gly ala thr val asn gin gly pro leu 

2671 

CCT GCT GAT CCA AAA CTC TAT CGA CAT CAC 
pro ala asp pro lys leu tyr arg his his 

2731 

ATC ATG AGA TGT GGT GAA GCT GTA GAG AAA 
ile met arg cys gly glu ala val glu lys 

2791 

GAA TAT CAG CAG GAA CTC AAA AAG AAC TAT 
glu tyr gin gin glu leu lys lys asn tyr 

2851 

ATC GAG CGG AAA ATT CCA GAA CTG TAC AAG 
ile glu arg lys ile pro glu leu tyr lys 

2911 

GAC TCC TTC CAC AGA TCT AGT TTC AGG AAA 
asp ser phe his arg ser ser phe arg lys 

2971 

GAA AAG CCA TCT TCA TTC GTG GAG ACT GTG 



3031 

TAC TTA AAA AAT GGG ACA TTT GCC ACC CAG 
3091 

ACT CTG GAA GCT TTG GGA TCC CAG GAA CCA 
3151 

ATT TTT GCC ATA CTG GGG GGT GGC GGG ATG 
3211 

TTA TTA AAG TGT GTT TTT CCA CAA TGT ACC 
3271 

CTG GCC AAT CAC TGC CCA TCT GAG AGA TGA 
3331 

AGT AAC TCA AAT TGC CTG AGG AAA AAT GGA 
3391 

TTT CAC TCT TTA TAG GAA GGC AGG GCA AAC 
3451 

CTA CAA AGG GAA GCC TTA CTA CAA TTC CAA 
3511 



A 
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AM, TCA TCA TGG TTG GAA ATT TGG GAG GAG ATT ATT TGT GAA CTT GTT ACC CTT TTG GTA 
3541 3571 

ATG GTG GAG TAA TTG CTG TAT AGT TAT TTT TGT TTT ATT ATT ACT GTT ACA TTA ATT TAA 
3601 3631 

CAT GCA TTT ATA GAA GAA TAG ATT CAA AGC ACT GAT GTA GGA GAT ACA CGG TAC TTG GAG 
3661 3691 

CAG TCA GCC AAA AAT CAC AGA TAC TGC TTT CAC TTA AAT GGA AAC AAT TCT CCG ATA ATG 
3721 3751 

CTT TGC TTT TTT TCT TAT GTC ACT CTT GTG TAC TAT CTA TTT TTC TCC TCT CTG GGA CCA 
3781 3811 

AGT TTC TTT TTA TAA AGC AAT AAT ATC TCT GTT TTC ATT TCA GAA CAT TGT GCT GTC TGT 
3841 3871 

CAG CAT ATG TAT ATC AGC TAC AAA ATA TAT TCA ACT TTG ACT TCT TTT GAC AAA GGA CTT 
3901 3931 

TAG GAA AAG GAG GAA CAA AGA CAT TAT TTG AGA ATT AAA TTA TAT ATT TTT AAT ATG ACT 
33961 3991 

J GTG ACC TTG ACT GAT AAT AAA GAT GTA ATA AGA ATT GCA AGC TAA AAA AAA AAA AAA AAA 

-r|4021 
fiAAC TCG 



References 

BAG sequences of Human CLASP 5 
TjRef 1-1 

:| Sequence of BAC19 using primer HC5S1 1, which spans nucleotides 3-22 of the cDNA. E\on 

--sequence is underlined and represents nucleotides 32-57. 
CTCTCTGTcm^CATATCTTCCAG GTTATAAAGNATTATTTACTAAAAGA^ 
TATTTCATTTAACrAGCTCAGTTTAATCATGTATTTCCTATAAAGGTTAGTC^ 
TTGACAAAACAATCAAACAATTCAAACCAGATCAAGTATGCTACCCTGAAGTTACACC 
ACTAGCTAAGAATTAACAATCTAAGTAATTGGTTTCTCCCCAGGCTCAAGGCT 
TCAGGTTAAGTAAAGCCAAGAATCCAATAAGCCCTATGAAATTTAGAAACTCATAGAA 
AAGTCTCAAATCTTCTTGTCTGACATTAGCCAATTGTTATATTATGCAAATAGAGGAT^ 
NCAAGTAAATAAGTTTGGAACCrGTTTACCAGGTTTTTGCAGCAG 

AACTGGTCATGCATTGAATGCCGAGAGCAAAGAGGAATGGAGAGGGGNTGTAAGNGG 

TTCCAAT^m^ACTGGAACCCACCACTATCIT^NGAAGTC^^ 

GNCTCnTTAGGCCTOTANTAAOTAGAATCTATATGGATTC^^ 

TGCCTATGAAA 



A 
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Ref2.1 



Sequence of BAC19 using primer HC5AS 10b, which spans nucleotides 560-580 of the cDNA. 
Exon sequence is underlined and represents nucleotides 510-5^. 

TGCGAGTAGTGTCCGCGTTCAGCATGTTGTAGTGCTTATAGGGCT GAAGGGAOrTrArn 

ATTGGGGGATGGAGGCCAGGGAAGAAGTCAAGCAACAGAAAAATTTGAGGCTTAACA 

GTCAAGCAACAGAAAAATTCAAAGTGTTCTCTTAAAATACCATGACTGTACATCACTG 

CTAGGCTGGAGATCTATTGCCAGTAGCCCrGCCTTCCCTAGGCAGGGGAAGCTGTGTT 

CTTTGAGTAGCGCTACTCAGCAAAGAGGCTCACCTGGGGCAGTATTTGAGCTAGGCTT 

TCAGCCACCGTATCTGAGTACCTCTGTCTTANGAGCAGTGTGGCCTGGTGATCACCCCT 

GGGCCTTGATCATGCGTGCTGCAATCCCAGTGATACAAAGAGGCTTTCATGCTGCTAA 

GATCTCCAAGTATTTCTCCTTCGTGCTGGGCAGCAGAGGGTTAGACTTNCAGGGGAGA 

AGGAAACTGGCTGGGTGCCATGAATAANCTTGCTGTTCAAGA^^TAACTTC^TGTTAC 

ATAAGNGCAAAGGTATAACATAAAGGGNCATGAACTGCTCAACNAAATTNATCAAAT 

CCATGTTTGTGGGAGTTCTTTTGTNATNGGAAGTTTAACCCCTAA 

;;:{Ref3.i 

^iSequence of BAC13 using primer C5S3, which spans nucleotides 1086-1 105 of the cDNA. Exon 
uysequence is underlined and represents nucleotides 1 1 10-1 120. 
;; fccCNGCTCrTTTTGGCAA NGTAANCTTGGGATGCTTGriMM( 

:|nAAGAN1-1"1"1"1'1UAGCTTCATACITCTCTCTTCAGGGGGACCAAAAGTCACAGAGCATA 

:otaagtggcanaaccccnaaggtcrtaagtcrtcctaggaagaaagcagatgccctga 
;\ttctgtgggaagccaccatggagaggaaaagcagtggctcccatatttgaagtgngga 
mccraactctagaaagtttaaaanggccatttgctgaagggctatgacatgagaacaga 

hlGATCAACTGAGTGACTTAGCAArm'CACTCriTCTCTGTAANACCrCTGGTGAGTGAGA 

■;:j^^^aaatcct^^'atgtgacgcccattagtcttacaaaangtcatgccntaaaatgccan 
j^gaaggncagaaatgaatttctcaccgccngaggaatgaggattatnctggggggaca 
"tgcanaaatattnnncccccnatttattnatttatttatttttgagacngagtntcgnt 
ctaatcgcccccaggctggnaggtggnaggtggttcccatcttnaancttanntngga 
aggncctctttgngccccnggggggngnaaagngaattccctaaatgcctncannccc 
ctccctggangttatttgggggnnttntaaagggcngtggcng 



Sequence of BAC13 using primer C5S7, which spans nucleotides 2196-2205 of the cDNA. . Exon 
sequence is underlined and represents nucleotides 2225-2231. 

ACA A A AACT AACCATCANTCTCTAA ATCCr A ACA A N( : I " 1 ' n ' I ' f 1 A A G A AT A PPT A A NG 

AGCTCAACNAGGGGGACrNTCCAANGCACTTAAATGCAGNCAAACNACNCCNNCAAG 

AGNGGCAACTACTAATGGGGCANATCTNAAAGAAAATATAGNCAAAGGNNGGAATCA 

TAATAGGAGCNACCACTTANGAAGCACCAACTGGGGACCTGGAACTGNATNAGGNCC 

TCTACATACATCATNNNATANCATCCTGCAACGACCCCTGGAAGGAGANAGANGGNA 



Ref4.1 
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TTCCTANNNTAGAGANGAGANAACTGGGGACATGGGAAGAGGNAAGCGAAGGGTTCA 

AGGGGANGNAAGCGAGCAGANNCCAGGGNCrCANACTNGNGGGGNNTGGGGGGNTN 

CTGNNNCCCTACNCTTNGNANGAACAGNGNNGTTGANNTGGCTTTNGANTA 

RefS.l 

Sequence of BAC13 using primer 122047FI, which spans nucleotides 3537-3556 of the cDNA. 
Exon sequence is underlined and represents nucleotides 3000-3492. This region does not contain a 
intron, in this region 

CCANNAGATT^r^^GNAACGNNGGTAGGCTTCCTTTGTAGATT^ATTGAAAATGTTTCGT 

ACTTCTACAAGTTTGCCCTGCCITCCTATAAAGAGTGAAANTCANTTTGAATCGACTOG 

TGGATAATTNTTCCATTTTTCCTCAGGCAATTTNGAGTTACrCCAATAAATTCAAATAT 

GGGCCAGAGGAAATCATCnrCAGATGGGCAGTGATTGGCCAGTCAGCAGGAGAAGC 

TGCTTATGCCTTGTTTGGTACATTGTGGAAAAACACACTTTAATAAATACGCANTCATG 

CCTGAGTACCCATCCTCCATCCCGCCACCCCCCAGTATGGCAAAAATCTGGTCAGAGT 

CCATTTGGGAATAATTCCATGGTTCCTGGGATCCCAAAGCTTCCAGAAGTGCTGGCTG 

; A\TCAANGGAGTGTACAGTCAGTCCTGGGTGGCAAAAATGTCCCATTTTT^ 

H JAGCAAAGGTTCCTTCTTNCAAGGGTTNCTAGGGCC 



A 



FIG. 4 
7 of? 



Figure 

Multiple sequence alignment of Human CLASP proteins with intron/exon borders 
indicated by a vertical line. Numbers in right margin correspond to References 



HC2A 

KIAA ASGraJ^KNARFSAJYHQDSKKLSNBmUOJJ^ 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A 

KIAA FPirrWSSYIPTKQFETCSKTPITFEVEEFVPCIPKHTQpyTiyT>^ 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A VLHHHQNPEFYDEIK 

KIAA KSKfiiCAJfO^IAICIEJFKDSDKEDSQPIJCCiyGRP 

rat 

HC4 

HCl 

: HC3 

; HC5 



HC2A IELPTQLI£EKHHLLLTFFHVSCDNSSKGSTKKRDVVETQVGYSWLPLLKDGRVVTSEQHI 

^ KXAA I£IJ>TQIJIEKHHTJ.T.TFraVSCDNSSKGSTiqgiPVVETQVGYSWLPIiIJ^ 

irat 

HC4 

, HCl 

I HC3 

; HC5 



fHC2A PVSANLPSGYLGYQELCa^GRHYGPEIKWVDGGKPLIJCISTHLVSTVYTQDQHLHNFFQyC 

I KIAA PVSANLPSGYLGYQEIX3*GRHYGPEIKiraDGGKPIJJfISTHlVS 

rat ^ 

HC4 

HCl 

HC3 GPGPARSTVSISLISNSARV 

HC5 



HC2A QKTESGAQALGNELVKYLKSLHAMEGHVMIAFLPTIIJJQLFRVLT-RATQEEVAVN^ 

KIAA QKTESGAOAIXSNELVKYLKSLHAMEGHVMIAFLPTIIJ^QL 

rat 

HC4 MEIQVLIRFLSVILMQLFSiVLPNMIHEDDVPISCPMV 

HCl MSFLPIILNQLFKVLV-QNEEDEITTTVTRV 

HC3 NRSRSLSNSNPDISGTPTSPDDEVRSI IGSKGLDRSNSWVNTGGPKAAPWGSNPSPSAES 

HC5 



B 
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H02A 

rat 
HC4 
HCl 
HC3 
HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A 
KIAA 
rat 
HC4 

HCl 
HC3 

HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HCS 



HC2A 

KIA^ 

rat 

HC4 

HCl 

HC3 

HCS 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HCS 



IIHV'VAOtrHEEGLESHLRSYVKrAyKAEPW.t^EVKr/HEELTKiMT^ 
1 1 HV^/A^H£XGI^5F^Sr/KyAYKAE?r/ASEyKr/HEEirKSM7^ 

LrHIVSKCHEEGLDSYLSSFIKYSrePGKPSAPQAPLIHETlATMMIALUCQSADriJklN 

LPDIVAKCHEEQLDHSVQSYIKFVFKTR ACKERPVHEDLAKNV^GLLK-SNDSPrv^ 

TQAMDRSCK?>lSSHTETSSrLQTLTGKL? TKKLFHEELALQJryVCSG--SVR — -E 

Cadherin 



KLLRYSWFFFDVLIKSMAQHLIENSKVKLL 
KLLKYS W FFFDVL I KSMAQHL I ENSKVKLI 


RKQP 
RNQR 


FPASYHHAAETW>^LMPHITCKFGD 
FPASYHHAVETWNMLMPHITQKFRD 


KLLKYSWFFFEI IAKSHATYLLEENKIKL1 
HVL^fesW FF FAII LKSMAQHLI DTNK I QL£ 
SALOQAW FFFEI>|VKSMVHHLY FNDKLEAf 


HGOP 
RPQP 
RKSP 


FPKAYHHALHSLFLAIT- 1 VESQYAE 
FPESYQNELDNLVMVLSDHVIWKYKD 
FPERFMDDIAALVSTIASDIVSRFQK 



NPEASKNANHSLAVFIKRCFTF^5DRGFVFKQIN NYIS— CKAPGDE^TLFEYKFEFL 

NPEASKNANHSLAVFIKRCFTFMDRGFVFKQIN NY I S —CFAPGDPKTiKEYKFEFL 



IPKESRlTVNYSLASFIxKCCLTLMDRGFVFNLIN DYIS — GFSPKDPKVLAEYKFEFL 

AJiEKTRRATHSVARFU^CFTFHDRGCVFKKVN NYIS — MFSSGDLKTLCQYKFDFL 

DTEMVERLNTSIAFFLNDLLSVMDRGFVFSLIKSCYKQVSSKLYSLPNPSVljvSLB^ 



RWCNHEHYIPLNLPM PFGKGRI^R YQDLC- DYSLTrEr 

K WCNSHY I PLNLrM ? FGKGRI :}R Y^DLQL DYSLTrEF 



::ticnhehyi?lni?m afpj^pklc?^ vor-sxi eysisiev 

QEVCCHEH F I PLC LP I RS AN I PDPLTPSE5 TQELHAS E^IPE YSVTNE F 

RIICSHEHYVTLNLFCSLLTPPASPSPSVSSAI^QSSGFSTN'/QDQKIANMFELS— VPF . 
MNADTAPTSPCPSIS Sqf^SSSCSSFQDQKIASMFDRTSRVPA 

Cadherin 



EC motif 



CHi^HFLVGL 
CRNHFLVGI 


LLRE 
LLRE 


VGTALQEFRE VRLIAIS v'LKNLLIKHSFDrRYASFjsHQARIAT 

VGTALQEFRE V?.L lAI S v'T.KNLLIKHS FDDRYASRSHQARI AT 


CKHHFLVGI 
CRKHFL3GI 
RQQHYLAGI 
SSTS-SPGL 


LLR£ 
LLRE 

LFTi 


TS I ALQDNYE 1 RYTAI S VI KNX.LI KHAFDTRYQHKNQQAKI AQ 

VGFALQEDQD VRHLAIAVLKNLMAKHS FDDRYREPRKQAQ IAS 

LAVILDPDAEGLFGLHKKVINMVHNLLSSHDSDPRYSDPQIKARVAM 
LAAALDAEGEGISEVQRKAVSAIHSLLSSHDLDPRCTKPEVKVKIAA 



LYLPLFGLLIENVQRIKVRDVSPFPVKAG-MTVKDESLALPAVNPLVTPQKGSTLDNSLH 
LYLPI^iJ^IENVQRINVRBVSPFPVKAG-MTVKDESIALPAVNPLVTPQKGSTm 



LYLPFVGLLLENIQRLAGRDTLYSCAAMPNSASRDEFPCG FTSP— A3J— R35LS 

LYMPLYGMLLDNMPRIYlJ<DLYPFTVNTSN(fcsRDDLSTNGGFQSQTAIKHANSVDT£FS 

LYLPLIGIIMETVPQLYDFTETHNQRGRPICIATDrY*ESE SG 5MI5 

LYLPLVGIILDALPQLCDFTVADTRRYR TSGSDEEQE GA GP.IT 



B 
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hc:a 

rat ' 

HCl 
HC3 
HC5 



k: LLaAiSG4^?YTTiT?N"S5VTlNAr>5RGSLI5TDSGNSLP£RNS£KSNSLDi^QQ£S 
yc:^hLGAlSGl]KS?Y7rSZ?}i:SS'^mDSRGSLlS7DSGSSLPIJ<}iSZKSnSW}QiOQSS 



TDKDTAYGSFQNG-- 
KDVLNSi;>^FSS~- 
C-TVAMA I AGTS VPQ- 
iN''>^AlJM AGKN FN — 



HG I KBS DS RGS L I P - EGATG FPDQGNTGEN TRQS 

• lAISTVNHADSBASLASLDSNPSTNEKSSEKTDNCEKIPRPL 

LTRPGSFLLTStSGRQHT 

LKTSG-IVLS3LPYKQYN / 



HCIA TlGNSv^llCDKLDQSEIKSLIJ^Cr^YIlKSMSDDALFTYWN-KASTSELMDFFTISEVCl 

KIAA TLGNSVVKCDKIJDQSEIKSLlWCFLyiIJ<SMSDDALFTYWN-KASTS£IJ4D^ 

rat 

HC4 STRSSVSQYNHLDQYEIP^LLMCYLYIVKMISEDTLLTYWN-KVSPQELINILILLEVCL 

HCl aligstlrfdrldqaetrslu^cfi^.imktisyetliaywq-rapspevsdffsiiIdvcl 

HC3 TFSA£SSRSLLICLL)m.KN-ADETVLQKWFTDI^VI.QIJ^RLLDLl}YL^^ 

HC5 MLNADTrRNLMICFLWIHKN-ADQSLIRKWIADLPSTQLNRILDLLFICV 



HC2A HQEjbYMGKRY lAKNQEGLG— PIVHDRKS QTLPVSRNRTGMM 

KIAA HQFQYMGKRYIAR TGMM 

rat 

HC4 FHFRYMGKRNIARVHDAWLSKHPGIDRKS QTMPALRNRSGVM 

HCl C^FRYMKRNI IRKIAAAF--KFyQSTC»WGTIJCGSHPSCQTSGIJAQ«MHSTSRHEGK^ 

HC3 SCFEYKGKKVFERMNSLTFK--KSKZ3MRAK LEEAIiGSIGARQEMV 

HC5 LCFEYKGKQSSDKVSTCVLQ— KSRDVKAR LEEALLRGEGARGEMM 



HC2A HARLQCL SSLDKS LTFNHSYGHSDADVX.HQSLLEANIATEVC 

KIAA rlARLQQL GSLDNS LTFNHSYGHSnADVMQSLLEANIAT£VC 

rat 

HC4 ;ARLQHL SSLESS FTLNHSSTTTEADirHQALLEGNTATEVS 

Hc 1 :hrs ct L ? I : RGK N.ALSNPKL LQMLDNTMI^NSNE IDI VHHVPTEAN I ategc 

HC 3 ?ilS RGQL^RS PS GSAFGS QENLPiJRKmTHWRQNTEKLDKSRAE I EHEAL I DGNLATEAN 

HC5 PilRAPGNDRFP GI^ENLKifKKEQTHWRQANEKLDKTKAELI>QEAI.ISGNLATEAH 



I. HC2 A LTALDTLSL FTIAFl(NCI-t^HGHNPIMaCVFDVYU:FLQKHQSETALra Y 

KIAA LTALBTLSLFTIAFKNCIJaADHGHNPIiiKKVFDVyLCFLQKHQSE Y 

rat klsrghsplmkkvfdvylcflqkhosemalknvftalrsliv 

HC 4 LTVLDT I S r FTQC FKTH rLKNI<5HNPL>4KKVFDI HLAFLKNGQSEVSLKHVFAS LRAF 1 5 

UJ HCl IT I ldlvslftqthqrclqqcdcqnslmkrgfdtymlffqvnqsatauot/fasuu.^ 

Q HC 3 L : I L DT LE I WQTVS - - VIES - -KES I LGGVLKVLLHSMACNQSAVYLQHCFATQRALVS 

Q HC5 LI ILDMQEKI IQASS--ALDC--KDSLLGGVIi<VLVNSIJ^CDQSTTYLTHCF;^IJ^ 3 . / 

HC2A KFPSTFyEGRAmCAALCyEIIJCCCNSKI^SIRTEASQU.YFIMlNNFDYTGKKSr»mT 

KIAA KFPS TFYEGRAHMCAALCYE ILKCCNSKI*SS IRTEASQXiLYFIiiRNNFDy TGKKSFVRTH 

rat KFPSTFYEGRAIWCASLCYEYIJCCCNSKLSSIRTEASQLLyFIMRNNFDyTGKKSr^ 

HC A KFPSAFFKGRVNMCAAFCYEVLKCCTSKISSTRKEASALLYLI/lRNNFEyTK^ 

HC 1 KFPSAFFQGPADLCGS FC YEVLKCCNHRSRSTQTEASALLYXEMRKNFE FNKQKS IVRSH 

HC3 5^PELLFEEETEQCADLCLRLIJ^CSSSIGTIRSHPSASLYLI^C»l51EIGN--HrARVK 

HC5 KFGDLLFEEEVEQCFDLCHQVLHHCSSSMDVTRSQACATLYLLMRFSFGATS~NFARVK 



HC2A LQVI ISVSQLIADWGIGETRFQQSI^IINNCANSDRLIKHTSFSSDVKDLTKRIRTVLM 

KIAA LQVI ISYSQLIADWGIGGTRFQQSLSIINNCANSDJU^IKHTSFSSBVKDLTKRIRTVLM 

rat l^VIISLSQLIADWGIGGTRFQQSI^IINNCANSDRLIKHTSFSSDVKDLTKRIRTVLM 

HC4 LQI I lAVSQLIADVALSGGSRFQESLFI INNFANSDRPMLARAFFAEVKPLTKRIRTVLM 

HCl LQplKAVSQLIAD-AGIGGSRFQHSIAITNNFANGDKQMKJ^NFPAfn^ 

HC3 MQVPMSLSSLVGTSQNFKEEFLRRSLKTILTYAEEDLELRETTFPDQVQDLVFNLHMI LS 

HC5 MQVTMSlASLVGRAPDFNEEHIilRSIJlTIIAYSEEDTAMC^TPFPTQVEELLCNLNSILY 
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HC2A 

rat 
HC4 
HCl 
HC3 
KCS 



T ransmembrane 

ATA0MKEHENDPEMLVT>LQYSUU<SYASTPELiU<TWX.DSMARIHV7^ 
ATA<>lKEHENDPEmVDI.QYSIJUCSYASTPELRKTWI^SMARIHVKNGDI^E^ 
A7AQt^iiEm>P€HLVDU:iYSJJiKSYASTP^ LSEAAMCYVHV 
ATAQMKEHEKDPEMLIDIK^YSIJ^YASTPELRKTWLDSMAKIHVKNGI iFSEAAMCYVHV 
ATM3MKEHEKDPEMLVDLQYSIJWSYASTPEIJU^TWLESMAKIHARNGI iLSE^>tAMCYIHI 
DTVKMKEH(^PEKLIDLMYRIAKGYQTSPDLRLTWL(»«^ HAEAAQCLVHS 
DTVKMMFQ3E:DPEMIM3IJ*YRIAKSYQASPDIJILTWI^^ YTEAAMCLVHA 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



dOBiain 



SH3 



TALVAEYirRKGV— 
TALYAEYI rRKEA— 



TALVAEYl lEKEJU) -lAK^EPPVFPYSHTSCQRK 3IUX31FRQ(3CTAFRVrTPN 



AALVAEFI BRKKL — 



FRQGCTAFRVITPN 

VQI«EFPI»LPHSHSACI*R1^RGGVFRQGCTAFRVI TPN 



AALVMYIS4LED 



AALIAEYI KRKGYUKVEKIC ^RSJULSEDTHPCDSNSLLTTP SGGSMFSMSRPAFLSITPN 

AALVAEYqSKLED : — BKYLPVGCVTFC^tSSN 

-HS YLPVGSVSFQNI SSN 



FPlftGCSAFKKITPN 



ITAM 



HC2A 

KIAA 

rat 

HC4 

HCl 

KC3 

HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



IDEEASMMEDVCMQD VHFNEDVXMELLEQCAPGLWKAEI YELIAPIYKLIjIPI 

I DEEASMMEDVC21QD VHFHEDVXJffiLIXQCADGLWKAE HYELIADI YKL j l P I 

I DEEASMMEDVGMQD VHFWEDVIMEI.LEQCADGLWKAE RLRAG LLTSIKSSSP 

I DEEGAMKEDAQ4MD VHYSEEVLLELLEQCVNGLWKAE^YETiSE I SKL I GP I 

IKEEGAAKEDSQfflP TPYNE^ILVEQl.YMCGEFLWKSElt[HiiAEW^ lAV 



VI^£SAVSDDVVSPD££GICSGKYFT£SGLVGLL£QAAASFSMAGMYEAVN£ VYKV1. IPI 

vleeswsedtlspdeix;vcagqyftesglvglleoaaelfstgglyetvne\^l3i P I 

ITAM ITAM ITAM ITAM 

YEKPfiD • — ' 

YEKRRDFEMAHI^Dl^HRrfYSi^^ 

SMKSGGTLETTHL f DTI HRF YSKV TEVTTR A AGSWDLLPGGLFGQ 

YENRREFENLTQV fKTI HGA YTK3 LEVMHTKKRLLG TFFRVA F^'GQ 

FEKQRDFKKI^DIgYDj HRS lYLKM AEWNSEKRL FG FjYYR\|AFYGQ 
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CGGTAACCGCCATTTTGTCTCCTGTAACAATTTACGCGCCGTGTAACTGTGAATCTTTCAAAGCCTCAGTTTTATGACC 
CTGTGGAGCCAGTGGACTTTGAAGGACTTCTG -1 
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GAC TTC ACT GAT 
asp phe thr asp 

CCC TCT TTG CCG 
pro ser leu pro 

TAC ATC CGT GAG 
tyr ile arg glu 

TTT AAA AAG ACT 
phe lys lys thr 

TCG GAA ACC TTG 
ser glu thr leu 

CTG TGC GAC GTG 
leu cys asp val 

CAG CCT GAC AAG 
gin pro asp lys 

CAG AAC GAG GAG 
gin asn glu glu 

GTG GAC GAG GAG 
val asp glu glu 

GGC AAC AGA ATA 
gly asn arg ile 

TTT GCC AGC ATT 
phe aia ser ile 

TGT GAC CTG AAC 
cys asp leu asn 

GCA TCA AGT CAG 
ala ser ser gin 

CTG GTA GTC AAG 
leu val val lys 

TAC ACG GTT ATC 
tyr thr val ile 

CTC CAA GCT GAA 
leu gin ala glu 

CCC ATA AGC TTA 
pro ile ser leu 
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1021/341 

TCA AGC TTC TTC AAT GTC TCC ACC CTT GAG 
ser ser phe phe asn val ser thr leu glu 
1081/361 

GGG AGA AGC CCA GTG GGT GAA CGG AGG ACA 
gly arg ser pro val gly glu arg arg thr 
1141/381 

GCC CTC TCC TTG GAG GAA AAT GGG GTT GGA 
ala . leu ser leu glu glu asn gly val gly 
1201/401 

AGC AGC TTT TTC AAG CAG GAA GGA GAT CGC 
ser ser phe phe lys gin glu gly asp arg 
1261/421 

GCT GAC TAC AAA AGA TCA TCA TCC TTA CAG 
ala asp tyr lys arg ser ser ser leu gin 
1321/441 - 

AGA CTG GAG ATT TCT ACA GCT CCA GAG ATC 
arg leu glu ile ser thr ala pro glu ile 
1381/461 

CCC GTG AAA CCC TTT CCT GAA AAC CGG ACA 
pro val lys pro phe pro glu asn arg thr 

r-l 1441/481 

ACA CGA GAA GTA TAT GTC CCT CAC ACT GTG 

TIt thr arg glu val tyr val pro his thr val 

,1 1501/501 

;:^f AGG CTG AAC TTT GTA AAC AAA CTA GCA TCA 
'1: arg leu asn phe val asn lys leu ala ser 
1561/521 

Cfl ATG TGT GGA GAA GAT GCT AGC AAT GCG ATG 
Q met cys gly glu asp ala ser asn ala met 
1621/541 

U^^: GAA TTT CTG CAG GAA GTG TAC ACA GCT GTT 
ni glu phe leu gin glu val tyr thr ala val 
\2 1681/561 

GAA GAA GTG AAA ATT AAG CTC CCC GCT AAG 
;2 glu glu val lys ile lys leu pro ala lys 
:;!f 1741/581 

hJ TTC TAC CAT ATC AGC TGT CAG CAG AAG CAA 
■ phe tyr his ile ser cys gin gin lys gin 
1801/601 

TCA TGG CTG CCA ATT CTC TTA AAT GAA CGT 
ser trp leu pro ile leu leu asn glu arg 
1861/621 

GCC TTG GAA AAA TTG CCA CCC AAC TAC TCC 
ala leu glu lys leu pro pro asn tyr ser 
1921/641 

AAT CCT CCC ATT AAG TGG GCT GAA GGA CAT 
asn pro pro ile lys trp ala glu gly his 
1981/661 

GTT TCT TCT GTA CAC ACC CAG GAC AAC CAC 
val ser ser val his thr gin asp asn his 
2041/681 

CTG GAG AGC CAG GTG ACC TTC CCC ATC CGC 
leu glu ser gin val thr phe pro ile arg 
2101/701 

CTG GAG CAT GAG CTG AAG CTC AGC ATC ATC 
leu glu his glu leu lys leu ser ile ile 




1051/351 

AGG GAG GTA ACT GAT GTG GAC TCT GTG GTT 
arg glu val thr asp val asp ser val val 
1111/371 

TTG GCC CAA TCT AGA AGG CTT TCT GAA AGA 
leu ala gin ser arg arg leu ser glu arg 
1171/391 

TCC AAC TTC AAA ACC TCC ACT CTG AGC GTT 
ser asn phe lys thr ser thr leu ser val 
1231/411 

CTT AGC GAT GAA GAC TTA TTC AAG TTT TTA 
leu ser asp glu asp leu phe lys phe leu 
1291/431 

AGA CGA GTC AAG TCA ATT CCA GGC TTG CTA 
arg arg val lys ser ile pro gly leu leu 
1351/451 

ATC AAT TGC TGT CTG ACT CCT GAA ATG CTG 
ile asn cys cys leu thr pro glu met leu 
1411/471 

CGC CCG CAC AAA GAG ATT TTG GAA TTT CCA 
arg pro his lys glu ile leu glu phe pro 
1471/491 

TAC AGA AAC CTT CTC TAT GTC TAC CCA CAG 
tyr arg asn leu leu tyr val tyr pro gin 
1531/511 

GCC CGG AAC ATT ACA ATA AAG ATC CAG TTT 
ala arg asn ile thr ile lys ile gin phe 
1591/531 

CCG GTC ATC TTT GGA AAA TCC AGC GGG CCT 
pro val ile phe gly lys ser ser gly pro 
1651/551 

ACA TAC CAT AAT AAG TCT CCT GAC TTT TAT 
thr tyr his asn lys ser pro asp phe tyr 
1711/571 

CTC ACA GTA AAT CAC CAC CTC CTG TTC ACC 
leu thr val asn his his leu leu phe thr 
1771/591 

GGA GCC TCC GTG GAA ACT CTC CTG GGA TAT 
gly ala ser val glu thr leu leu gly tyr 
1831/611 

CTT CAA ACT GGA TCC TAC TGT CTC CCA GTT 
leu gin thr gly ser tyr cys leu pro val 
1891/631 

ATG CAT TCT GCT GAG AAA GTC CCA TTA CAG 
met his ser ala glu lys val pro leu gin 
1951/651 

AAG GGA GTA TTT AAT ATT GAA GTG CAA GCT 
lys gly val phe asn ile glu val gin ala 
2011/671 

CTG GAG AAG TTC TTC ACC CTC TGC CAC TCC 
leu glu lys phe phe thr leu cys his ser 
2071/691 

GTG CTG GAT CAG AAA ATC AGC GAG ATG GCG 
val leu asp gin lys ile ser glu met ala 
2131/711 

TGC CTG AAC TCC TCC CGC CTG GAG CCG CTC 

cys leu asn ser ser arg leu glu pro leu 
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2161/721 

GTG CTC TTC CTG CAC 
val leu phe leu his 
2221/741 

ATC GCT GGC CAG ACA 
ile ala gly gin thr 
2281/761 

AAC AGT CTG CAC AAC 
asn ser leu his asn 
2341/781 

GCT TCC TAC GTG CAC 
ala ser tyr val his 
2401/801 

GGC GCT CCC ACT GCC 
gly ala pro thr ala 
2461/821 

GCT GCT GTG AGT TCA 
ala ala val ser ser 
2521/841 

GCG GGG ACA CAC TCC 
ala gly thr his ser 
2581/861 

GAT CGC AAC TGC AGC 
asp arg asn cys ser 
2641/881 

CCT GCA GCC CCA AGG 

pro ala ala pro arg 

2701/901 

GTG GTC AGC ACC GGA 
val val ser thr gly 
2761/921 

GAC AGT TTT CGG AGG 
asp ser phe arg arg 
2821/941 

AAT GTG GTC ACC TCG 
asn val val thr ser 
2881/961 

GCG GAA AAG ATG AAC 
ala glu lys met asn 
2941/981 

CGG GGC TTT GTG TTT 
arg gly phe val phe 
3001/1001 

AAC CTT CCA ACG CTC 
asn leu pro thr leu 
3061/1021 

CAT TAC CTC AAT CTG 
his tyr leu asn leu 
3121/1041 

CCT TCC ATA TCT TCC 
pro ser ile ser ser 
3181/1061 

AGC ATG TTC GAT CTG 
ser met phe asp leu 
3241/1081 

ACA GAA CTG GCT GCT 
thr glu leu ala ala 




CTG GTG CTG GAC AAG 
leu val leu asp lys 

GCC AAC TTC TCC CAG 
ala asn phe ser gin 

AGC AAG GAC CTG AGC 
ser lys asp leu ser 

TAC GTC TTC CGC CTG 
tyr val phe arg leu 

CTC CTA GAC CCT CGG 
leu leu asp pro arg 

AAG CTG CTG CAG GCC 
lys leu leu gin ala 

GCA GCA GAC GAG GAA 
ala ala asp glu glu 

CGA ATG TCT TAC TAT 
arg met ser tyr tyr 

CCA GCC AGC AAA AAG 
pro ala ser lys lys 

ATG GTG AAA AGC ATG 
met val lys ser met 

ACT CGT TTT TCT GAC 
thr arg phe ser asp 

GAA ATT GCA GCC CTT 
giu ile ala ala leu 

ATC AGC CTG GCT TTC 
ile ser leu ala phe 

AAC CTC ATC AGA CAT 
asn leu ile arg his 

ATT TCC ATG AGG CTA 
ile ser met arg leu 

AAC CTT TTT TTT ATG 
asn leu phe phe met 

CAG AAC TCA AGC TCC 
gin asn ser ser ser 

ACT TCC GAG TAC CGC 
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GCC CTG GAT GCC GAA 
ala leu asp ala glu 
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GCT GTC AGT 


GCA 


ATT 


ala val ser 
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ile 
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cys 


AGT 
ser 


ACC 
thr 


CAA 
gin 


GTC 
val 


CTG 
leu 


CGT 
arg 


GGG 
gly 


GAA 
glu 


GGG 
gly 


GCC 
ala 


CCA 
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CTC 
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CTC 
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ATG 
met 


CAA 
gin 


GTA 
val 


ACC 
thr 


ATG 
met 


TCC 
ser 



3331/1111 

TCT CAC GAC CTG GAC 
ser his asp leu asp 
3391/1131 

TAC CTA CCT TTA GTT 
tyr leu pro leu val 
3451/1151 

GCA GAT ACT CGC AGA 
ala asp thr arg arg 
3511/1171 

ATT AAC CAG AAT GTG 
ile asn gin asn val 
3571/1191 

ATA GTG CTG TCT TCC 
ile val leu ser ser 
3631/1211 

AAC CTC ATG ATC TGC 
asn leu met ile cys 
3691/1231 

TGG ATT GCT GAC CTG 
trp ile ala asp leu 
3751/1251 

GTG TTA TGT TTT GAG 
val leu cys phe glu 
3811/1271 

CAG AAG TCA AGG GAT 
gin lys ser arg asp 
3871/1291 

AGA GGG GAG ATG ATG 
arg gly glu met met 
3931/1311 

AAT TTG AGA TGG AAG 
asn leu arg trp lys 
3991/1331 

ACA ?iAG GCC GAG TTA 
thr lys ala glu leu 
4051/1351 

TTA ATC ATC CTG GAT 
leu ile ile leu asp 
4111/1371 

GAC AGC CTG CTG GGA 
asp ser leu leu gly 
4171/1391 

ACC ACC TAC CTG ACT 
thr thr tyr leu thr 
4231/1411 

TTA CTC TTC GAA GAG 
leu leu phe glu glu 
4291/1431 

TGC AGC AGC AGC ATG 
cys ser ser ser met 
4351/1451 

AGG TTC AGT TTT GGA 
arg phe ser phe gly 
4411/1471 

CTG GCA TCT TTG GTG 
leu ala ser leu val 
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pro 


CGC 
arg 


TGT 
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GTC 
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gly 
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ATT 
ile 


TTG 
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GAT 
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thr 
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GGC 
gly 
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ala 


CTG 
leu 


GCC 
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ATA 
ile 
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ala 


TTG 
leu 


CCC 
pro 


TAT 
tyr 


AAG 
lys 


CAG 
gin 


TTC 
phe 


CTC 
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TGG 
trp 


ATC 
ile 


ATG 
met 
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TCA 
ser 


ACG 
thr 


CAG 
gin 
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leu 


TAT 
tyr 


AAG 
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gly 
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CAG 
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GTC 
val 


AAG 
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CGG 
arg 


CTG 
leu 


CGC 
arg 
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arg 


CGG 
arg 


GCT 
ala 


CCA 
pro 


AAA 

lys 


GAG 
glu 


CAG 
gin 


thr 


PAT 
his 


GAT 
asp 


CAA 
gin 


GAA 
glu 


GCC 
ala 


TTG 
leu 


ATG 
met 


CAG 
gin 


GAA 
glu 


AAC 
asn 


ATT 
ile 


GGT 
gly 


GTT 
val 


CTG 
leu 


AGG 
arg 


GTG 
val 


CAC 
his 


TGC 
cys 


TTT 
phe 
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thr 
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val 
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TGT 
cys 


GAT 
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GTC 
val 
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gly 
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GCA 
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CCA 
pro 


GAC 
asp 
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4441/1481 

TTT AAT GAA GAG CAC CTG AGA AGA TCC TTG 
phe asn glu glu his leu arg arg ser leu 
4501/1501 

ACA GCC ATG CAG ATG ACT CCT TTT CCC ACC 
thr ala met gin met thr pro phe pro thr 
4561/1521 

AGC ATC TTA TAT GAG ACA GTG AAA ATG AGG 
ser ile leu tyr asp thr val iys met arg 
4621/1541 

GAT CTC ATG TAC AGA ATT GCC AAG AGT TAC 
asp leu met tyr arg ile ala lys ser tyr 
4681/1561 

CTC CAG AAC ATG GCA GAG AAA CAC ACC AAG 
leu gin asn met ala glu iys his thr lys 
4741/1581 

CTG GTG CAC GCC GCT GCG TTA GTG GCT GAG 
leu val his ala ala ala leu val ala glu 
4801/1601 

CTG CCC GTG GGC AGT GTC AGC TTC CAG AAT 
leu pro val gly ser val ser phe gin asn 
p 4861/1621 

J| GTC TCT GAG GAC ACC CTG TCA CCT GAC GAG 
val ser glu asp thr leu ser pro asp glu 

iJ^ 4921/1641 

GAG AGT GGC CTG GTA GGC CTC CTG GAG CAG 

^'L.; glu ser gly leu val gly leu leu glu gin 

;:^J 4981/1661 

TAT GAG ACA GTT AAT GAG GTC TAC AAG CTG 
tyr glu thr val asn glu val tyr lys leu 

- 5041/1681 

TTC CGG AAG CTG ACA CTC ACT CAC AGC AAG 

rU phe arg lys leu thr leu thr his ser lys 

y. 5101/1701 

i y AAG GAT CAT AAG AGA ATG TTT GGA ACC TAC 
lys asp his lys arg met phe gly thr tyr 

5161/1721 

GGG GAT TTG GAT GAA CAG GAG TTT GTC TAC 
gly asp leu asp glu gin glu phe val tyr 
5221/1741 

ATC TCA CAT AGA CTA GAG GCA TTT TAT GGT 
ile ser his arg leu glu ala phe tyr gly 
5281/1761 

ATT AAA GAC TCC ACT CCT GTG GAC AAA ACC 
ile lys asp ser thr pro val asp lys thr 
5341/1781 

ATC ACT TTT GTG GAG CCC TAC TTT GAT GAG 

ile thr phe val glu pro tyr phe asp glu 
5401/1801 

GAG AAG AAT TTC AAC CTC CGG AGG TTC ATG 
glu lys asn phe asn leu arg arg phe met 
5461/1821 

CCT CGG GGA GAG CTG CAT GAG CAG TAC AGA 
pro arg gly glu leu his glu gin tyr arg 
5521/1841 

TTC CCC TAC ATC AAG ACC AGG ATC AGC GTC 
phe pro tyr ile lys thr arg ile ser val 




4471/1491 

AGG ACA ATT TTG GCC TAT TCA GAA GAG GAC 
arg thr ile leu ala tyr ser glu glu asp 
4531/1511 

CAG GTG GAG GAA CTT CTC TGT AAT CTG AAT 
gin val glu glu leu leu cys asn leu asn 
4591/1531 

GAA TTT CAG GAA GAT CCT GAG ATG CTT ATG 
glu phe gin glu asp pro glu met leu met 
4651/1551 

CAG GCA TCT CCT GAT CTG CGG CTG ACC TGG 
gin ala ser pro asp leu arg leu thr trp 
4711/1571 

AAG AAG TGC TAC ACG GAG GCT GCC ATG TGC 
lys lys cys tyr thr glu ala ala met cys 
4771/1591 

TAT CTG AGC ATG CTG GAG GAC CAC AGC TAC 
tyr leu ser met leu glu asp his ser tyr 
4831/1611 

ATT TCT TCC AAT GTG CTG GAG GAG TCT GTG 
ile ser ser asn val leu glu glu ser val 
4891/1631 

GAT GGG GTG TGC GCA GGC CAG TAC TTC ACC 
asp gly val cys ala gly gin tyr phe thr 
4951/1651 

GCC GCG GAG CTC TTC AGC ACG GGA GGC TTA 
ala ala glu leu phe ser thr gly gly leu 
5011/1671 

GTC ATC CCC ATC CTA GAA GCG CAT CGA GAA 
val ile pro ile leu glu ala his arg glu 
5071/1691 

CTG CAG AGA GCC TTC GAC AGC ATC GTT AAC 
leu gin arg ala phe asp ser ile val asn 
5131/1711 

TTC CGA GTT GGT TTC TTT GGA TCC AAA TTT 
phe arg val gly phe phe gly ser lys phe 
5191/1731 

AAA GAG CCT GCA ATT ACC AAG CTT CCT GAG 
lys glu pro ala ile thr lys leu pro glu 
5251/1751 

CAA TGT TTT GGT GCA GAA TTT GTG GAA GTG 
gin cys phe gly ala glu phe val glu val 
5311/1771 

AAG TTG GAT CCT AAC AAG GCC TAC ATA CAG 
lys leu asp pro asn lys ala tyr ile gin 
5371/1791 

TAT GAG ATG AAA GAC AGG GTC ACA TAC TTT 
tyr glu met lys asp arg val thr tyr phe 
5431/1811 

TAC ACC ACC CCG TTC ACC CTG GAG GGG CGG 
tyr thr thr pro phe thr leu glu gly arg 
5491/1831 

AGG AAC ACA GTC CTG ACC ACT ATG CAC GCC 
arg asn thr val leu thr thr met his ala 
5551/1851 

ATC CAG AAG GAG GAG TTT GTT TTG ACA CCG 
ile gin lys glu glu phe val leu thr pro 
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5581/1861 5611/1871 



ATT GAA GTT 


GCC 


ATT 


GAA 


GAC 


ATG 


AAG 


AAG 


AAG ACC CTG 


CAG 


TTA 
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GTT 
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ATT 
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ile glu val 
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ile 


glu 


asp 


met 


lys 


lys 


lys thr leu 


gin 


leu 


ala 


val 


ala 


ile 


asn 


5641/1881 
















5671/1891 
















CAG GAG CCG 


CCT 


GAT 


GCA 


AAG 


ATG 


CTT 


CAG 


ATG GTG CTG 


CAA 


GGC 


TCT 


GTG 


GGA 


GCT 


ACT 


gin glu pro 


pro 


asp 


ala 


lys 


met 


leu 


gin 


met val leu 


gin 


gly 


ser 


val 


gly 


ala 


thr 


5701/1901 
















5731/1911 
















GTA AAT CAG 


GGA 


CCA 


CTG 


GAA 


GTA 


GCC 


CAA 


GTG TTT TTG 


GCT 


GAA 


ATT 


CCT 


GCT 


GAT 


CCA 


val asn gin 


gly 


pro 


leu 


glu 


val 


ala 


gin 


val phe leu 


ala 


glu 


ile 


pro 


ala 


asp 


pro 


5761/1921 
















5791/1931 
















AAA CTC TAT 


CGA 


CAT 


CAC 


AAC 


AAG 


TTG 


AGG 


TTA TGC TTT 


AAG 


GAA 


TTC 


ATC 


ATG 


AGA 


TGT 


lys leu tyr 


arg 


his 


his 


asn 


lys 


leu 


arg 


leu cys phe 


lys 


glu 


phe 


ile 


met 


arg 


cys 


5821/1941 
















5851/1951 
















GGT GAA GCT 


GTA 


GAG 


AAA 


AAC 


AAG 


CGT 


CTC 


ATC ACG GCA 


GAC 


CAG 


AGG 


GAA 


TAT 


CAG 


CAG 


gly glu ala 


val 


glu 


lys 


asn 


lys 


arg 


leu 


ile thr ala 


asp gin 


arg 


glu 


tyr 


gin 


gin 


5881/1961 
















5911/1971 
















GAA CTC AAA 


AAG 


AAC 


TAT 


AAC 


AAG 


CTA 


AAA 


GAG AAC CTC 


AGG 


CCA 


ATG 


ATC 


GAG 


CGG 


A/iA 


glu leu lys 


lys 


asn 


tyr 


asn 


lys 


leu 


lys 


glu asn leu 


arg 


pro 


met 


ile 


glu 


arg 


lys 


5941/1981 
















5971/1991 
















ATT CCA GAA 


CTG 


TAC 


AAG 


CCA 


ATA 


TTC 


AGA 


GTT GAG AGT 


CAA 


AAG 


AGG 


GAC 


TCC 


TTC 


CAC 


ile pro glu 


leu 


tyr 


lys 


pro 


ile 


phe 


arg 


val glu ser 


gin 


lys 


arg 


asp 


ser 


phe 


his 


6001/2001 
















6031/2011 
















AGA TCT AGT 


TTC 


AGG 


AAA 


TGT 


GAA 


ACC 


CAG 


TTG TCA CAG 


GGC 


AGC 


TAA 










arg ser ser 


phe 


arg 


lys 


cys 


glu 


thr 


gin 


leu ser gin 


gly 


ser 


OCH 











GAAAAGCCATCTTCATTCGTGGAGACTGTGGCCCTGCAACCCTGGAGAAGGACTTGCTGGTACTTAAAAAATGGGACATT 
TGCCACCCAGGACTGACTGTACACTCCCTGATCAGCCAGCACTCTGGAAGCTTTGGGATCCCAGGAACCATGGAATTATT 
CCCAAATGGACTCTGACCAGATTTTTGCCATACTGGGGGGTGGCGGGATGGAGGATGGGTACTCAGGCATGACTGCGTAT 
TTATTAAAGTGTGTTTTTCCACAATGTACCAAACAAGGCATAAGCAGCTTCTCCTGCTGACTGGCCAATCACTGCCCATC 
TGAGAGATGATTTCCTCTGGCCCATATTTGAATTTATTGGAGTAACTCAAATTGCCTGAGGAAAAATGGAAAAATTATCC 
ACCAGTCGATTCAAACTGAATTTCACTCTTTATAGGAAGGCAGGGCAAACTTGTAGGAGTACGAAACATTTTCAATAAAT 
CTACAAAGGGAAGCCTTACTACAATTCCAAAAATCATCATGGTTGGAAATTTGGGAGGAGATTATTTGTGAACTTGTTAC 
CCTTTTGGTAATGGTGGACTJVATTGCTGTATAGTTATTTTTGTTTTATTATTACTGTTACATTAATTTAACATGCATTTA 
TAGAAGAATACATTCAAAGCACTGATGTAGGAGATACACGGTACTTGGAGCAGTCAGCCAAAAATCACAGATACTGCTTT 
CACTTAAATGGAAACAATTCTCCGATAATGCTTTGCTTTTTTTCTTATGTCACTCTTGTGTACTATCTATTTTTCTCCTC 
TCTGGGACCAAGTTTCTTTTTATAAAGCAATAATATCTCTGTTTTCATTTCAGAACATTGTGCTGTCTGTCAGCATATGT 
ATATCAGCTACAAAATATATTCAACTTTGACTTCTTTTGACAAAGGACTTTAGGAAAAGGAGGAACAAAGACATTATTTG 
AGAATTAAATTATATATTTTTAATATGACTGTGACCTTGACTGATAATAAAGATGTAATAAGAATTGCAAGCTAAAAAAA 
AAAAAAAAAAAACTCG 
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A. Allelic variations: single nucleotide changes (polymorphism) between CLASP-5 cDNA 
isoforms 



Isoform 


Nucleotide(s) 


Consequence 


1 


1727 


C to T change; mis-sense 






mutation changing codon jfrom 






alanine to valine 


2 


1749 


A to G change; silent mutation 


3 


2277 


G to C change; silent mutation 


4 


2853 


C to T change; silent mutation 


5 


3427 


A to G change; mis-sense 






mutation changing codon from 






lysine to glutamic acid 


6 


3777 


C to T change; silent mutation 



B. Alternative splices 



Isoform 



Difference 



Nucleotide(s) 



Consequence 



1 



exon deletion 



exon msertion 



1806-1944 



between 2857 and 
2858 



premature, in-frame stop codon 
leading to the production of a 
truncated, most likely soluble 
protein 

additional, in-frame 48 nucleotide 
exon that contains a stop codon at 
the second codon, which would 
lead to a truncated, most likely 
soluble protein 



These differences may be found separately or together in various combinations in the different 
human CLASP-5 isoforms 
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1'' partial exon (nucleotides 3793 to 3952) 

CCAGCTGTCAGCCAAGCTCAGTAACCTTCCAACGCTCATTTCCATGAGGCTAG 
AGTTCCTGAGAATCCTCTGTAGCCATGAGCATTACCTCAATCTGAACCTTTTT 
TTTATGAATGCTGATACTGCTCCAACATCTCCTTGTCCTTCCATATCTTCCCAG 
GTAATAAAAGAATTATTTAACTAAAAGAATTATTCAAGCTAT 

2"** exon (nucleotides 5809 to 5948) 

GCTCATAAAATGGCTCCTTACGTTTCTGTAG AACTCAAGCTCCTGCTCCAGGT 
TCCAGGACCAGAAGATCGCCAGCATGTTCGATCTGACTTCCGAGTACCGCCA 
GCAGCACTTCCTCACCGGGCTCCTCTTCACAGAACTGGCTGCTGCCCTGGATG 
CCGAAGGGGAAGG GTATGTTTCTGGrATTTA A A ATan\ AHAJGA AQC 

3"* exon (nucleotides 13662 to 13831) 

CATAACCTCTTGATTCCTGTGTTGTGCCAACAG AATCAGCAAAGTACAAAGG 

AAAGCTGTCAGTGCAATTCACAGCCTGCTAAGTTCTCACGACCTGGACCCAC 

GCTGTGTCAAACCAGAGGTGAAGGTCAAAATCGCCGCCCTTTACCTACCTTTA 

GTTGGCATCATTTTGGATGCTTTGCCACAGCTCTGTGACTTTArAG GTAATGG 

CCCTTCTGTTTTCTTTCTTGGATTG 

4*^ exon (nucleotides 16948 to 17087) 

TGTTTGACTTGACATCACAAACGATGTTTTCATTGCAG TTGCAGATACTCGCA 

GATACCGCACCAGTGGCTCGGATGAAGAACAAGAAGGAGCCGGTGCCATTA 

ACCAGAATGTGGCTCTGGCCATAGCAGGGAATAATTTCAATTTGAAAACAAG 

TGGAATAGTGCTGTCTTCCTTG GTATGTTGrTTrTrArATr.Tr.TrTonTTr,ATTTT 
TCAT 



5* exon (nucleotides 19281 to 19463) 

TGGCCTCCATCCCCCAATCTGCCTCCCTTCAG CCCTATAAGCAGTACAACATG 

CTGAACGCGGACACTACTCGCAACCTCATGATCTGCTTCCTGTGGATCATGAA 

AAATGCTGATCAGAGCCTCATTAGGAAGTGGATTGCTGACCTGCCATCAACG 

CAGCTCAACAGGATTTTAGATCTACTTTTCATCTGTGTGTTATGTTTTGAGTAT 

AAGGTAAGTCTGGAGTGGCACAACTTTATACCAGC 

6* exon (nucleotides 19829 to 19958) 

CACCAAAGGACATGTCCTCCTACCTCTGTCTTGTCCAG GGAAAACAGAGTTCT 
GACAAAGTCAGTACCCAAGTCCTGCAGAAGTCAAGGGATCtTCAAGGCCCGG 

CTGGAAGAGGCTTTGCTGCGTGGGGAAGGGGCCAGAGGGGAGATGATGCGC 
CGCCGGGCTCCAGG TGTGTTGaACTC^rrrrTTrrrTnrTrTnTQjr/^j^Qn 
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7* exon (nucleotides 20928 to 21015) 

TCAAATTCCTATCATGCATTTCTTAACTCCTA GGGAACGACCGATTTCCAGGC 
CTAAATGAAAATTTGAGATGGAAGAAAGAGrAGACACATTGGCGGCAAGCT 
AATGAGAAGCTAGATAA GTGAGTCArTrfiGr.AArTTTrTGrTArTTTTACCT 

8* exon (nucleotides 25765 to 25861) 

GCTTTAATTTGACCTCTTGTTGTTTCCTA GAACAAAGGCCGAGTTAGATCAAG 
AAGCCTTGATCAGTGGCAATCTGGCTACAGAAGCACATTTAATCATGCTGGA 
TATGCAGGAAAACATTATCCAG GTGAGGA A A Ar A A AC Arm a ATnjr^.f^YTTG 

9* exon (nucleotides 27242 to 27376) 

GGATTCAATGATGCTGTTCTTCCATTCCCCCA GGCGAGCTCGGCTCTGGACTG 
TAAAGACAGCCTGCTGGGAGGTGTTCTGAGGGTGCTGGTGAATTCTCTGAAr 
TGTGATCAGAGTACCACCTACCTGACTCACTGCTTTGCAACACTCCGTGCTCT 
CATCGCCAAGGTAAACTTGGGATGCTTGTTTTCTTCCTCTTAATT 

10* exon (nucleotides 28582 to 28734) 

AGTGATGCCTAATGGCCCTTTATGTCTCTCCTAG TTTGGAGACTTACTrTTr.G 
AAGAGGAGGTGGAACAGTGTTTCGACCTATGTCACCAAGTCCTGCACCACTG 
CAGCAGCAGCATGGATGTCACCCGGAGCCAAGCCTGTGCCACCCTTTACCTC 
CTCATGAGGTTCAGTTTTGGAGCCACrAGT GTA AGAGTTP A A ArcAnrTnAr. 
TGACCTGGAATCAG 

1 1 * exon (nucleotides 3 1 046 to 3 1 204) 



TGCAAGTAACCATGTCCCTGGCATCTTTGGTGGGAAGAGCACCAGACTTTAA 
TGAAGAGCACCTGAGAAGATCCTTGAGGACAATTTTGGCCTATTCAGAAGAG 
GACACAGCCATGCAGATGACTrrTTTTCCCACCCAG GTAnArrGAAfiTArAT 
ACCTTGTCTCATGCATGAGT 

12th exon (nucleotides 32755 to 32855) 

AGCTAAGATTATTTTGAGGCTTACACTTTTTGCA GGTGGAGGAACTTrTrTnT 

AATCTGAATAGCATCTTATATGACACAGTGAAAATGAGGGAATTTCAGGAAG 
ATCCTGAGATGCTTATGGATCTCATGTAG AG GTA AGTTTTrrTr. A r Ar Arxr- ^ 
AGGGACACCATTT 

13* exon (nucleotides 33663 to 33855) 

TCCTCAAAACTACTTCTCACTCAATCTGTCTTCA GAATTGCCAAGAGTTACCA 

GGCATCTCCTGATCTGCGGCTGArrTGGCTCCAGAACATGGCAGAGAAACAC 

ACCAAGAAGAAGTGCTACACGGAGGCTGCCATGTGCCTGGTGCACGCCGCTG 

CGTTAGTGGCTGAGTA TCTGAGCATGCTGGAGGACCACAGCTACCTGCCCGT 

GGGCAGTGTCAGCTTCCAG GTAGGGTGTOTGr A GrTTTTrrrTT AGAGCAGTG 

GTTC 

14* exon (nucleotides 38125 to 38268) 
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CTGTTCTCCAGGCTTATACTGTGGTCTCTTTCAG AATATTTCTTCCAATGTrTCT 
GGAGGAGTCTGTGGTCTCTGAGGACACCCTGTCACCTGACGAGGATGGGGTG 
TGCGCAGGCCAGTACTTCACCGAGAGTGGCCTGGTAGGCCTCCTGGAGCAGG 
CCGCGGAGCTCTTCAGCACG GTCAGTGCCrAGAGGGrATrrrGGnrTrrTarTr 

c 

15* exon (nucleotides 40166 to 40297) 

AATTCTCTCTGATGCTCTTCTCCTCTTTCCAA GGGAGGCTTATATGAGACAGT 
TAATGAGGTCTACAAGCTGGTCATCCCCATCCTAGAAGCGCATCGAGAATTC 
CGGAAGCTGACACTCACTCACAGCAAGCTGCAGAGAGCCTTCGACAGCATCG 
UAACAAGGTAGCCGGGGAGCCTGGCTGGCAGGTCTTGTTAC 

16* exon (nucleotides 40755 to 40889) 

TAAGGAGAGCTTTTTATATTTTGTTCCTCA GGATCATAAGAGAATGTTTGGAA 
CCTACTTCCGAGTTGGTTTCTTTGGATCCAAATTTGGGGATTTGGATGAACAG 
GAGTTTGTCTACAAAGAGCCTGCAATTACCAAGCTTCCTGAGATCTCACATAG 
ACTAGAGGTAAGAAAAGTGATTCTGTGCGCCTGACCTGGTACACTTTAC 

17* exon (nucleotides 42307 to 42396) 

AACCTTTATAAACTGTTGGTTCTTCTTACCTA GGCATTTTATGGTCAATGTTTT 
GGTGCAGAATTTGTGGAAGTGATTAAAGACTCrACTCCTGTGGACAAAACCA 
AGTTGGATCCTAACAAG GTATAC A A A A ATTT Ar A A A A APT A A rr a jf" A A GC 

1 8* exon (nucleotides 45250 to 45486) 

TCTTCTCCCTCCGTGCCTTTTCCCCCTTA GGCCTACATACAGATCACTTTTGTG 

GAGCCCTACTTTGATGAGTATGAGATGAAAGACAGGGTCACATACTTTGAGA 

AGAATTTCAACCTCCGGAGGTTCATGTACACCACCCCGTTCACCCTGGAGGG 

GCGGCCTCGGGGAGAGCTGCATGAGCAGTACAGAAGGAACACAGTCCTGAC 

CACTATGCACGCCTTCCCCTACATCAAGACCAGGATCAGCGTCATCCAGAAG 

GAGGAGGTAATGCACCCAAGGGATTGGCCACCACTGGATGAGT 

19* exon (nucleotides 48664 to 48807) 

ACAGTGACTTCCCTATGTTTACGTCTCATGTTCA GTTTGTTTTGACACCGATTG 
AAGTTGCCATTGAAGACATGAAGAAGAAGACCCTGCAGTTAGCAGTTGCCAT 
TAACCAGGAGCCGCCTGATGCAAAGATGCTTCAGATGGTGCTGCAAGGCTrT 
GTGGGAGCTACTGTAAATCAGG TAAGGAAAAr.rAGAGGTGnrAr^TrrT 

20* exon (nucleotides50892 to 50998) 

TATATTCTTTTTTTTTTTTTTTTTTTTTTTCCCACCAG GGACC 
CCAAGTGTTTTTGGCTGAAATTCCTGCTGATCCAAAACTCTATCGACATCACA 
ACAAGTTGAGGTTATGCTTTAAGGAATTCATCATGAG GTA AGA APtTtA A A ATO 
GCTGGGAATTTCAGTAGAG 

21^ exon (nucleotides 62398 to 62568) 

TCATTTATTTCTCCCACACTGATATTTTCATCTCA GATGTGGTGAAGCTGTAG 
AGAAAAACAAGCGTCTCATCACGGCAGACCAGAGGGAATATCAGCAGGAAC 
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TCAAAAAGAACTATAACAAGCTAAAAGAGAACCTCAGGCCAATGATCGAGC 

GGAAAATTCCAGAACTGTACAAGCCAATATTCAGAGTTGAGAGTCAAAAGAG 

GTAAGAACAGGGCAGAGGAGGCCTCTTCCTGTGGGAT 

22nd exon (nucleotides 63040 to 63294) 

CCTCCCTCTCTTTTCTTAATTTCAG GGACTCCTTCCACAGATCTAGTTTCAGGA 

AATGTGAAACCCAGTTGTCACAGGGCAGCTAAGAAAAGCCATCTTCATTCnT 

GGAGACTGTGGCCCTGCAACCCTGGAGAAGGACTTGCTGGTACTTAAAAAAT 

GGGACATTTGCCACCCAGGACTGACTGTACACTCCCTGATCAGCCAGCACTC 

TGGAAGCTTTGGGATCCCAGGAACCATGGAATTATTCCCAAATGGACTCTGA 

CCAGATTTTTGCCATACTGGGGGGTGGCGGGATGGAGGATGGGTACTCAGGC 

ATGACTGCGTATTTATTAAAGTGTGTTTTTCCACAATGTACCAAACAAGGCAT 

AAGCAGCTTCTCCTGCTGACTGGCCAATCACTGCCCATCTGAGAGATGATTTC 

CTCTGGCCCATATTTGAATTTATTGGAGTAACTCAAATTGCCTGAGGAAAAAT 

GGAAAAATTATCCACCAGTCGATTCAAACTGAATTTCACTCTTTATAGGAAG 

GCAGGGCAAACTTGTAGGAGTACGAAACATTTTCAATAAATCTACAAAGGGA 

AGCCTTACTACAATTCCAAAAATCATCATGGTTGGAAATTTGGGAGGAGATT 

ATTTGTGAACTTGTTACCCTTTTGGTAATGGTGGACTAATTGCTGTATAGTTAT 

TTTTGTTTTATTATTACTGTTACATTAATTTAACATGCATTTATAGAAGAATAC 

ATTCAAAGCACTGATGTAGGAGATACACGGTACTTGGAGCAGTCAGCCAAAA 

ATCACAGATACTGCTTTCACTTAAATGGAAACAATTCTCCGATAATGCTTTGC 

TTTTTTTCTTATGTCACTCTTGTGTACTATCTATTTTTCTCCTCTCTGGGACCAA 

GTTTCTTTTTATAAAGCAATAATATCTCTGTTTTCATTTCAGAACATTGTGCTG 

TCTGTCAGCATATGTATATCAGCTACAAAATATATTCAACTTTGACTTCTTTtG 

ACAAAGGACTTTAGGAAAAGGAGGAACAAAGACATTATTTGAGAATTAAATT 

ATATATTTTTAATATGACTGTGACCTTGACTGATAATAAAGATGTAATAAGAA 

TTGCAAGCTAAA AAAAAAAAAAAAAAAA 
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1 GTTCTCTGTGCTTAGTCTIlCTTAGTGACTTTAGOTAAGT^^ 

92 TTGAGVGATTTGAGGGCTGATCAACGAAAAGGArAGGACCOT 

183 CTVTAAGS^GGAAGCCCCTCCCAGCATGAGAACAGCC^ 

27 4 AGAGGAGGOTTAGGGOVGAAGCTTATAGOTCTGGGTGRGGCTGCTCCA 

3 65 TAGay3CTTAGGGCCCTGTATCTGCAAGGCTCTA.TCTT^ 

456 GGCACTAAATGGATGAAAATCTGCTTATATGAGCTATTTTTGTTAATGACTGGGCATGGT^ 

547 ?J3CACCAGIC(3^a^^ 

638 AAAGAAAAAGACATGACTTTTCTAAAGGAAGTCCTGTCTACAGCCATACCACCCTGAACGm 

729 AGGAAGCAGAATAATAAAAAAC^^CCTAAGGAAACATCAGAACTCCCCT^ 

820 GATCTTCTGTCCCAGGACCCACCAGAATAGAATGGCAGAGGGACACTTCACTCCTCC 

911 TAGTCTGTTTTCAAAA^TTTAAGTTGAGTGTATCAAAGRTAC 

1002 CTTCATTCCCG^CAGGGAAATAAGGCAACCCAAGGTAG^ITATCTGCCTTCCCTGC^ 

1093 TCTTTTCTCTTGCTCACTATCTTTCTCCTTAAATTCTTCCCTTTTCTTTTAGTTACC^ 

1184 CCAAT^GATCCCTAGT^CGAGGAGTCAAQ^GCCTGCATGCTGAT 

1275 AGGCATTCATTTCCTCATCTGCAAAAAAACTAGGCTGGA.TTAGATTTATCC^ 

1366 TGTTACTTAGCACCATTAGTGGCACTCAGGCCTCAGAAGGTCACTm^ 

1457 TACAGGATGAACTTCATACCTTAAGCAA.GTTGTATTCTTACAAAGTCGTCTGACT^ 

1 S4f CTAAAGATTTTGTGGCAATGGATATAACCTTACTCTTTTTTAATAGTC^ 

1 6 J9 TTACAATGTGCCACTAAGCAAACACCCATCCCTTGGGTAGGGGCATTGTGGGT 

173J) CTCCTCTTTTCAAAACATCACTCATiATTGTTTTTGAAATG^ 

ISai TGCAGTGGCACAATTTTGGCTCACTGCAAACTCCGCCTCACAGGTTCAAC^ 

1 9X^ GTGTGTGGCACCACTCCCCGCTAATTTTTGCATTTTTAGTAGAGACAGGG^ 

20iB AGTGATCTGCCTGCCTCTGCCTCCCGAAGTGCTGGGATTACAGCTGTGAGCCACCACACCCAGCCT 

2 of l3 CAAGGGA.CCTATGAAAGATACCCATAGrGGGGCCTTCTTTTAAGrGCCAATGTGCT 
2 lis ACCTGTTAATGAGTAACCTAAGTGACAGGCACATGACCAAGTTCrAATCCC 
22^6 GAGTOirACTVTCGAAAGGTTATGGGTVTCrGGTAACTGTGCTTACATAGAAG^ 

2 3=#7 TTATTTTAAGTGGATGTCTAACTATGAATTAATTCTGTAGGCAATATGTCCCAC^ 

2 41% GTTCAITTTW^TTATArTGTTTAGTCTGTAATCCCAGCACTTTGGGAGGCC^ 

25^#9 CTTGGCCAATATGGTTAAACCCCTVTCTCTACTAAAAATACAAAAAATTAGCTGGGC^^ 

2 AGGCTGAGGCAAGAGAATCGCTTGAACCTGGGAGATGAAGGTTGCAGTGAGCCGAGATCATC^ 

2i^fl CTCTGTATCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTATATATATATATATATATATATAC^^ 

2i#2 TGTGTATATATATATArGTATArATATATATATGTGTArATATATATATGTGTATATATATATATArATAGrT^ 

2913 TGAACTGAGTGAAGAGGAGGCATGTCTATCACAAGGATGATGCTTCATATTTCT 

3004 GAGCTCAAGGTGCATCAGCGCCCCTTCCTTCCCCTATTTTGCCTTTAGAGGAGCTGCCTCTGA 

3095 CTGGGGGAAGCAAGGGGTCTCTTCCAG?^CAGTCACCTTTTTCTGCTTTTC^ 

3186 CTAAAAACTCTTCTAGCTTTCTTTCTCTTTCTTCTCCATTTCCTCCTCTCTCAGTCTGG(^ 

3277 CTACTTTGCTAAACAGTTTGATGCCTTTCTCTAGGGAACTGGITCTACJ^ 

3368 TTGTASTATAAAAGTCAXCTAGTCATAITCTATTAGTTTCATACAGGC 

3459 AGAGTTTTTGTAAAATTAATTATGTTA::GGTGAAGATGTACCTCAAGAITTTCAGm 

3550 GGAAATCGGGATAAGAATCCTCATTCTGGACAGCTACTAGGCTAGAATCACTA^ 

3641 TGTAT^GCAGAATTTACCATCTGTTGCACTATCCCATTAGTTTTTM 

3732 TGCCAACCTCAACTCCACTTACCTTGTAATAAATGTTTC7OT 

3914 ACTGCTCCAACATCTCCTTGTCCTTCCATATCTTCCCAGGTAATAAAAGAATTA 

4005 TAGCTCAGTTXAATCATGTATTTCCTArAAAGGTTAGTCTTATTAATTTGAC^ 

4096 CTGAAGTTACACCACTAGCTAAG?^TTAACAATCTAAGrAATTGGT^ 

4187 GWCCAATAAGCCCTATGAAATTTAGAAACTCATAGAAAAGTCTCAA^ 

4278 TAGAGGATTCCAAAGTAAATAAGTTTTGGAAACCATGTATTCACCAGGTTTCTTTGCAGCA 

4369 TCAATTGCAAATGCACGAGAAGmGAAGAGGAAATGGGAGGAGCTGTGC^^ 

4460 CACCCGACCTCATACArrTTTTGAAATGTTACTCTGGAATAACTTT^ 
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5643 
5734 
5825 



4551 tcagtagctcjysagcotcaggcmaatttctc^ 

4 642 tctta::taattgaaaaaaaaatcttagccma^ 

4733 agatctgactcactvagcagcmttct^aaa 

4824 cttaccctaggttctttcccaccttccccttctgactgccccmgg^ 

4915 CTM^J^CACTTTPJ^Ji3:GCi:CTr 

5006 ATACTTCCCTCCCTGACATCACTTGTAGTTCOySGCCT^^^ 

5097 GCACGGTGGCTCACATTTGTAATCCC^y^ 

5188 TG?^AAACTCMATCTACTAAAAATACAAAAATGCATCA^ 

5279 TTGCTTGAACCCGGGAGAGGGAGGTTGCAGTGAGCTGAG^ 

5370 AAT^AAAAAAAAAAAAAAAAAGAAGGTACCTGAGTAAGCAGGGCCTTAAACAAAGGGQ 

5461 CCCAGGACTGGGTGAGGTTCTlATTTCTCTC?rCCMCm^^^ 

5552 TCM^TCATTTTTCAACAGACCCGGGGTGCTTTATTTTCATCTTTCC^ 

ACTCTGTGCCTCAGTTTCCTTGTCCGCCTTATCTGGCACCAGAGTACC^ 
CTCAGTAAACACTGGCCATCGCTATTTTCATTCCAGTTCTO^ 
CCAGCTTCC:7VGGACCAGAAGA.TCGCCAGCATGTTCGATC^ 

5916 ACTGGCTGCTGCCCTGGATGCCGAAGGGGAAGGGTATCTTTCTGGCATTTAAAATGGAAGATG^^^ 

6007 AAATTTGCAGTCTAGCTTCTCACACTTGGTAAAAAACTCTACTCT^ 

6 Q9| TACCCATAAGCACAAGGCACTKAGAAAGGCCTTTCTTGTGTAGAA^ 

61^ CTATTTCAGTGGAArTTACCAGTTCTGCTATAGCAGGITTCCCAAG^ 

62f|) aaaggggatagcatgcagaggcaaccai:::aa(^^ 

6311 TGCCCAGATTAGGTTTCTTGCAGACAGTGCTTCTCAGGCCAAGAG 

6 4p TGATTACTCAa^GACATGTGGCAGTATCTAGCCTAGAAGTCAACA 
65© CGTGAGCTCATGGAGTCAGAAACCCACAGCCTArCTGATTGGACT 

TATTG?\ATGCCTTCCATGGGACAGACACAGTTiy3^ 
TCAAGAGCTCACCATCCAGGGACTCAGAOTCAAAGATGATTi^^ 

6826 TACAGAATTGCAGGAACTTGCCCACTATTCCATAACACSVGAACC3\^ 

7 GCATTGCTGTATGTTATGGATGGATTTTTATCTAAATGG^^ 
7 d9f9 TTCCTGCATGAAACAAATACAGGAGCACATCGAGTTCCT 

7 TC^GTGACTCTATGTCTGCGTCTCCTGGCAGGATAGTGTGGGGAATCTCATC^ 

"^^S- gcctgagtatatcctttgagaacttca::tttccatggot 

73^ CTAGAATGTTCnACCCTACAACrAACATAAATTTCCACAGC^ 

74 63 AAGTTTGAACACAAAGAAAATTGCCTTCATGC:^^ 

7554 TAGAAACATCTTTAGTATTTACAATAGCGTAGTTTCTA^ 

7645 TTGTAAAATTGTTGCTGTTATATTACTAGCAACAAGGTAGAGTATATT^ 
TTCAAACAC^CACT^CACACACACACT^CA^ 

ACAGTGTAACTCCTTTGGCAACACAGTAATCCCTGATTGCTGGGTT 

7918 CATCATGTAAGCTAAAACCTCAGAAAMTGTCTVrGCTTC^ 

8009 AGACAGTTGGTAGACTTGCC7\ACACCTGTACATCCATGG« 

8100 CTCCAGTTTCCACTGGGTTTCCCTTGGATCTCTGACACCTGGGIA^^ 

8191 CMTTAGAAATTGTCTATCATCTTTGTGTTA^ 

8282 AACAACWW\ACTTTTG?WVTGCAATTTACAAT^ 

8373 TCTATATAACGTTGAATTCTGATGTAGCCTTATTTTGTTTGAAAG^ 

8464 •nTTAXTATTATCAT7U:TTTAAGCT 

8555 CACCCATTAACTCGTCMTTACMTAGGTATATCTCCT^^ 

8646 TTCCCACCTATGAGTGRGAACATGCGGTGTTTGGTT^ 

8737 CTACAAAGGACM'GAACTCMCCTTTTTTATGGCTG^ 

8828 TGGACATTTGGGTTGCOTCCAAGTCTITGCTATTGT^^ 

8919 ACAmrGTTTATTGCGGCAAGATGTATATTT^^ 

9010 TATTATAGTATAATTTATATAAAGAACCTGAATCCTAACT 
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9101 

9192 CTGCCTGTTGTTGAATTTCMAGTCTTTTGCTGAATGTTGTGACT 

92 8 3 CJ^TGTATGTCATATTCCTTTGTCTACTATAATTTCTCT^ 

9374 GAAAAC7\GCTGCCGTGAACATGCCTGTGCATGGTTTTGGGTGGACGTTAG^ 

94 65 TTTAATATACTGCTCTTGAATAGTTTAATAAATATGTGTACATGGTCTTAAC^^ 

9556 AGCAAACAAGTCAAAGCAGGAAGA.TGGCAGGGAATAACAAAGGTGATA 

9647 TCCGCAAGATGAAAAGCTTGTTAXTTGAAAAGAGCAACAAAATTCACCAATCTTT^^ 

9738 TTAClAAAATCATAATTGAAAG^VrTCAACT^CAA^ 

9829 TTGAAAGCCATCAAATTGATAACCTAGATTAAATGGATAAATTCCTTAAAAGGTA^ 

9920 ATCCAAATAGACCTACAGAAGTAAAAAGATTGAGTTAGTAATCAAACTTCCCAC^ 

10011 GGCTGGGCGCAGTGGCTCTVTACCTGTAATCCCAGCACTTTGGGTGGCC^ 

10102 GGCCAAGGTGATGAAACCCCGTCTCTACTAAAAATACAAAAATmGCTGGGTGTGG^ 

10193 tgaggc:agaagaatcaxttgagactggaaggc7^ggttgcagtg?vgccaag;^ 

1028 4 ACTCC7VTCACAAA.TAArAATAArAATAATAATATATTTTAAAATTTAAAACTTCCTACA^ 

10375 gaattctaccaaatatttttaaaagaattaattctaatttttta^c^^ 

104 66 ccccacttgttctatgaagctagcattaccctatactaaaccaga.caaagacatc:atgp^^ 

10557 atgtaagaccctcaacaaacactagcaaactgaatccmcagcatataa?^ 

1 OCl 8 TGCAAGATAGGCTGCATACCTGAAAATCAATTGITGTA::CM'Aarr^ 

1 Ofi 9 AGAAAAGCATTTAATJiAAATCTAATAACGCTTCCTGATAAAAACACTCT^^^ 

1 0 &I 0 A7\GGGCCTCTArGAAAAATCCACAGCTAATGTGACACTTATTAGrGAAA 

1 O^^l 1 CTACCCTTGCCACTTCTATTCAACATATTAGGAGTTCTATCTAGGGCAATTAGGCAAA 

1 vSi 2 GTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCJ\AAGTGC5\.CAGATCGCTT^ 

1 1 12)3 GCAAAACGCCATCTCTACAAGAAATACAAAAATTAGCTGGGCATTGGTGGCTTGTGTTTCT 

1 lip 4 AGAATTGCTTGATCCCAGAAAGCGGAGGTTGTAGTGAGCTGAGATCACGCTACTGCACTCCAGCC 

1 l"2:t 5 AAAAAAAAAAAAAAAAAGAAAAAGAAGAAAAGAAAAGAAAGCATTTAAATTGGAAAAGAAGTAAA^ 

1 1:37 6 TCTTGTTTATAGAAAACCATAAGGAATCCACAAAAAACTCCATmCAACTAATAAA 

1 i 46 7 tacaagaatcaattgtgtttctatac:acttacga.tgagcaatctgaaaatgaaat^ 

1 1;^ 8 aaaaaatatttaggaataaatgtaacaaaagaaacact^agagttatacactaaaaatga 

1 i 64 9 tgga.ggatatcatatgttcatgaatcagaagacttattattaaaatagcaatactcccc^^ 

1 life 1 aaaacaaccttgaaaaacaagaacaaaattggaggactcacj^cttccc^ 

1 il'2 2 ggttctgacatatg?^tagacatatagatcaatgaaattgggttaa.gr.gtccaa;^ 

12013 aagagtgccaaaacaattcaatgggggaaaatagaattttcaataa^ 

12104 ggaccctatattacactgtatacaaaaactaactcaaatagatcaaagacctaaatgta 

12195 tatagaggtaatcatcatag?^cttagaaaaggcagtgcnttcttag7^targa 

1228 6 ctggacttcagtaaaattaaaacttttgtgatttatagga.cacc:atc:7\aaaaaa 

12377 AAATCAAAAACCTAATAGGGGACTTGTATCTGGAATATATATTTTAAA7\TCTTACAA^ 

12468 AAAAGGCAAAAGATCAGAATAGACTOTrrCTCCAAAGA;^ 
12559 GCOTAITAAAATCAC7ATTAAATACC::ACTTGA.TACCC^ 

12 650 TGGAGrCTTCAGACACTGCTGGTGGGAATGTAAAArrGTGCAGCC^ 

12741 TGGGCGCTCGGCGGCTCACGCCTATAATCTCAGCACTTTGGGAGGCTGAGGTGt^^ 
12832 GCCAAGATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTASCCAGG^ 
12923 GAGTCAGAAGAATTGCTTGAACCCAGGAGGTGGAGGTTGCAGTGAGCCGAGATCCT 

13014 atwwaaaaaagttaaacagacagttaccatacaagccagcaaatgt;^ 
13105 atacacatactcatagct^gttggtaagtcacaatagctc^^ 

13196 ACCAMGGAAXATTArrTAGCAATAAAAAGGAATGAAGTACTGATGCTACA^ 
13287 AGGGTTAGAACACT^GGCAGTCTGTTTCTGGATTCTATCTAGT^ 

13 378 TTCTGGftTTCTAXATAGTTCTATCCTGTAITGCTTGGTTTTCCAl^^ 

134 69 CTGTArTGCTTGGTTTTCCAGGGTTAC3\ACACAGGCACTCTG^ 

135 60 ACCTCATTGCTTAGGAGCGtIutCATCACTCTTGACTGTT 
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13651 TGTGCCJ^CAGAATa^GCAAAGTACJy^AGGAAAG^ 

13742 AACCAGAGGTGAAGGTCJ^A7\ATCGCCGCCCTTTACCT;\CCTT^ 

13833 TAATGGCCCTTCTGTTTTCTTTCTTGG?\.TTGTTGGGGGCCCCTGCC7\AATGCCC 

13924 MMCTTCATATGAGraATTCTTCACT^^^ 

14015 J^CCCJVrATCTCTTTTTTGTTTAGAAA.TTATTrACTTGTTTCTO 

14106 TTTAIAAACCATACACCACAAAGGATGAAATAArAA^^ 

1 4 1 97 TCCCTTTTCTTTTATGAATATGTAAGCAAAAAAGCGGTCTCAAATGAAC^ 

14288 CAATATTAGAAAATTCTATCATAGCAGAGAAAIATTGCCCTTGGCACT^ 

14379 ATGAAAGGATAAGTTCTCTT7^GGGG?rTTTGTGAAACCCCAGAATCTAITTACAfiM 

14470 AMCCTCCCAAArAGAAGAACTTCTTATGTTGACCTGS^GAAAGGCA^ 

14561 TGTGCGCGCGTGCACACACACACACACACAATTCCTATCCCATC^ 

14652 TCCATGCTGCTCTCTTGGCAGCTGPAGGGTGCTGGGATCTGGGCAM 

14743 TTGGAACCATCCAGGTTTTCCCACCTCCTCCAAATGGAAAGTGTCATG^^ 

14834 AGTTCATGAAAATGGGACCAAACGCCTATAGTCTTTAATTCATA^^ 

14925 CTTTCTTGAAAAAAATCCTCTAGCA7\ACTGCTAATGGAGTAACTCAGAAGAGW^^ 

15 016 GGGGCCAGAGTGCTXAGGAAATGATAGAAACAAGCTCAGCACACAGGTC 

15 107 CCTGATTGCGCGGAATTGAAAATGGAGrTGTTTTTAACATTTACAGA.CM^ 

1 5 © 8 TCCTTGATGGCAAGTTTCTAAAAAGTTCCATTTCAAGAGCTTGCTTGI^ 

1 5S 9 TCTTGTCAATTAGAATATTCCCTATCAGTGACTCGGAATCAGAACTTTTC^^ 

1 5 '34 0 TGTATATGCGTTGAACTTTATGRTCATGATGTAGCTGATGRGAGAAAA^^ 

1 5-417 1 CTATTAAATATGGAATCTTTAGGTTGATAAACTCATAAGAATACAGTCTTTC^^ 

1 SilS 2 TAATATAAGTGACCTTTGAAAGGCTCAACCTTTCCGTGCCTTCCTTTCCTCTGTAAACTATAAAAATCT 

1 3 GTTTACGATAAAGGTGAAACTAGAGAACTTCTATAAAAGCATTTTGCACAGC^ 

1 ^;|;4 4 CCCTATGGCAATTAAGGGGA7yvrAAGAATGTGTCTCTATGTTAGTTGTGA.TAATGTTATC^^ 

1 5 TTAACCATTTGTTTAAAATCCCAATGGCCTTAATAAGTATTTCTAAAAAT^ 

1$^^2 6 GCACTTTGGGAGGCCAAGGTGGACAGATCACTTGAGGCCAGGGGTTCA^ 

1 4Qi 7 ATACAAAAGTTAGCCAGGCGTGGTGGCGCACCTGTAATCCCAQrrACTCGGGA^^ 

1 ^ 8 AGCGGAGATGOTGCCACTGCACTCCAGCCTGGATGATASTGTGAGACTTTG 

1 6 1.9 9 TTGTTTTCACAACTTTAGTAGAAGAAArGTTATTTACACTC^^ 

1 C29 0 TAGAAAAACATGAATTTTTAGGAAAACATGAGTGAATCAATCAGTTGTCS^ 

1 647 2 TAAATATGGGTATGTCTATATTACMAGATACAATAAATTGAC^^ 

16563 TAATAGCTTATTATTCTATATAAGGTAGTTGCTTAATTCTGTAOTTGTAGGTGTC^ 

16654 TAGAATAGAGAATGAAGTTTAAAAAATTATGTTACAGGAAACAATTATGG^^ 

16745 TTTCIAGCCTAATCTTGTGCTAGACATTGTCTCTTAGCCTGCTGTGTT^ 

16836 TGAAATTACTGTGCTGACTTTASTGACrGAGAAGTATCAGTCTCITAT^ 

16927 CAAACGATGTTTTCATTGCAGTTGCAGATACTCGCAGATACCGCACCAGrGGCTCG^ 

17018 GAATGTGGCTCTGGCCATAGCAGGGAATAATTTCAATTTGAAAAC^^ 

17109 TGGTTGATTTTTCATTTCATGTCTTCTGTCTTCTGTTTTGTTl^^ 

17200 ggagtgcagtggcgc:aa.tctcagttcactgcaacctccgcctcccgggct 

17291 GCACCACCAC7\CCTGGCTAATTTTTTTTGTATCTTTTAG^ 

17382 AGATGGTACACCCACCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTC^^ 

17473 ACTCJy3CTTAC3\GCAGGCATTGATTATTTCCGCCTCATTTTGCTGG^ 

175 64 AAGGGTTCTGGCAACAGCAAGAGATGTGCAGATCAAATTCAGG^ 

17655 AACTTGGGAAATCTGRAATCCTTGCAAAGGAGATCAAACCAAGGCTCTCTC^^ 

1774 6 A3TCATACAGTGTAATGCCTCC7\AAAAAAAAGACTG?^?^^ 

17837 GGACCCCAGGGAGCCTAAGGAGGGTTGTTTGGCTCTGTCCTTTTGC^ 

17928 TCTGATTCAGTTGAGGTGGGATTCATCTCTGTTCATTGATGAGCrC^ 

18019 ACTTTCAmTTTCAArGTAGGGTATAGATGTGC^ 

18110 TGGATGACTTTCTTGTCTTCCTTCCTGCACTTAGCAAGCTCCATCACCMGAC^^ 
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18201 AAAGS^TTTGAGATTTGACATCACCQ^CTTiCTGTTG^ 

18292 TAGCTGCTTGAGGTGGTATATGCCTTTTACATTATTTGCCTCTAAGGGAAA^ 

18383 TGTGAGGTTGTATCTC^\AGATTTCCTTTATTCTTGTCAACA^ 

18474 TCCT^AAACTAACAITTCCCTCCCATACCCACAAATTCCT 

185 65 CAAGGGTGCAGCCTGATTGTTGTTGTTTGTGTTGATGAGATGAGTAACA^^ 

18656 ATATTAAAGCAAATAGCCTGAACCCCCACACCCCAGCCCC^^ 

18747 TGGATTTGATTAAITTGCTGAGCAGTTCATGACCCTTATGTTATACTTTGC^ 

18838 TTCATGGCACCCAGCCAGTTTCCTTCTCCCCTGTAAGTCTAACCCTC^ 

18929 TGAAAGCCTCTTTGTATCACTGGGATTGCAGCACGCATGATCAAGGCCCAGGG^ 

19020 TCAGftXACGTGGCTGAAAGCCmGCTCAAATACTGCCCCTVGGTGAGCCTCTTTGCT 

19111 CTAGGGAAGGCAGGGCTACTGGCAATAGATCTCCAGCCTAGCAGTGATCTACTVGTC^^ 

19202 TGCTTGACTGTTAAGCCTCAAATTTTTCTGTTGCTTGACTTCTTCCCTG^ 

1 92 93 tac:aacatgctgaacgcggacactactcgcaacctcatg?vtctgcttcctctg^ 

19384 GGATTGCTGACCTGCCATCAACGCAGCrCAACJ^^ 

19475 GTGGCACAACTTTATACCAGCTCTTATCTCTCAATTGCAATTCTGTCTTCTTACTC^^ 

195 66 TTTTCTCATTTCTGTATTCAAATCCATAACCCATTTGmGGTATAGATAr^^ 

19657 AACCCCATTTCTGTTG?^CAGAGTTTTGGCCCATAGGATGCTC(::AGAa 

1 8 TGGftGTGTACTGTTTTGGTAACTCTCCCCATCAACGGAGATCTC^CCAAAGGACOT 
1 9p 9 GTTCTGACAAAGTCACTACCCAAGTCCTGCAGAAGTCAAGGGAT^^ 

1 0 AGGGO^GATGATGCGCCGCCGGGCTCCAGGTGTGTTGGACTGGCCCTTCCCTGCTCTCTGr 

2 dm 1 ATGTCCTCCCAACATGATTAGACACCArTACTTTCTTGAGATAm 

2 O: ifl 2 TTTGTGTTAGCCCTGTGCCTGTGAGG(WAGCGCTGTTCTACAArTGTA3GAA 

2 Oji) 3 CTCGTTGATGGTAAAAGGTCT^CCTCTAAGAATGTCAGTCATGGATGAGGTGGAAGGTGTTTCCCTC^ 

2 0T9 4 TTTGGCTAGTCTATCCGGAGCATGGTCAAGAAACTGAACAGTTCTGGCCCCAAGA^ 

2 dal 5 AACCAGTTCCCTGAAATGGCTTCAGAGrGTCCCAGTCATTCAAGC^^ 

2 047 6 AATGAAATCCCCAGGTACCCTCAGTCTTATTCACCATGCTCAAAGTAAA^ 

2 0S%7 GGAACTTGGAGGGAGCTCATAGTTTTCAGTGGTGGTCAGGCACCCTC^^ 

2 8 AGAGTTTCACAGGAATCCTCTGCTCATGAGAAATGTCTCTCTGA^ 

2 d7% 9 AGAGCGGGCCAACTTGGACATjySGTGTGaZGACTTTGTCTCCTACCAGCAACCTGCA 

20^0 CTTCTCAGTTG?VGCTTGTTATGAACTTCTGGTTATCTTGGAGGGT^ 

2 1 ACGACCGATTTCCAGGCCTAAATGAAAATTTGAGATGGPAGAAAGAGC^^ 

2 10=1 2 CACTCGGCAACTTTCTGCTACTTTTACCTAAAGTCCAAAACTATTm 

21113 GTTAGAAAATGAAACATCATTArCTGTGTAAATACAATTCATCCAGGGA^^ 

21204 TTAAAAATTTGTCACATTGATAAGCATGTAAAGAAAACT^^ 

21295 GGAGGGTAGCAGGGAAATAGAGAAAGCAGGACACGGTGCCTAGGACCGTATACTTTC^^ 

21386 ATATACATTCT^SATATATTTATGTCAGrGCTACTTAAAGTTGTT^^ 

21477 TTAAAAGTTGGATTGCCATCAAAATGrTTTAAMTCATCTTAATGA 

215 68 CTAGACACTTGGTTTTCTAGTAGTGTGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCT^ 

21659 GTTCQ^CTAGCCTGGCCAACATGGTGAAACCCCGTCTCTACTiA^ 

21750 AGCTACTCAGGAGGCTGAGGAGGAAGAGTCGCTTGAACCCGGGAGGCAGAGGTTGC:^^ 

218 41 GGTGACAGAGCAAGACTCCATCTCAAAAAAAAAAAAAAAAAAAG?y^CCTCA;^ 

21932 AGAAGGAAATTTACCAAAATATTAATTGCGGTTTTCTTTGGGTGCTGG^ 

22023 TTCCTACAGTAGATGTGTATTATAATATATATTGTTACCAGAAAA^^ 

22114 MCTATGCTATAGATGAATAGATATATTGACGTGTATTGCTATATATGCA^^ 

22205 TAGATAGCACATGTTTCCGATACCGTTTAGCCAC^^ 

22296 CCCTCACACACAGTTGACAAGGTAGGATACCCCATGGG^ 

22387 TTTGTAAATAAATGTGAGTTAATATCTTTATTGTTACCACAAGAM 

22478 TTTAAAACATTAAAGAAATAAGTATATTTTTATTATAGATG^ 

22569 TATTCTTGCACTAAATTCCCTGGACTGAITTATAAAAm 

22660 AAATATGCATGCCGTGGGTGCTACAGAATTGCTTAGGTGGTTTAGCTCOT 
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22751 TTTGCCTGGGCTGTGTAAACTGGTGTTTTGGAAAACCTTTCCCTTTTGCTTGCA 

22842 ACCGGITTGCTCTAGCTTAGTTCTCAGCCCTGCGCTACCTGGTGGGCTrc 

22933 GGCTTTGTGCTTTTTTTTTTTTTTTAAAAGGGAAACCATACT 

23024 AATGCTGTCTTTCACCCTCTTTCTGATCM'TTTTCGATTA^ 

23115 AGAATCACCTGGGGAGCTTTAAAACCTGTCAGTGCCCT^^ 

23206 MTTGGTAAAACTTCCCAGGTGATTCTATTGCCACCAAATCC^^ 



23388 CT^CAATCCTCCCACCTCAGCCTCCCAAGTAGOTGGGACTACAGCT 

23479 AGGGTTTCACCATGTTGCCCAGGCTGGTCTAAAACTCCTGGGCTCAAGTG^^ 

23570 ATGAGCC^^TGCGCCTGGTCAGATGGTAAAGCTTTTAAAAAACCAGATm^ 

23661 CTGAACTGTATACTTACT^TGGTTAAGATGGlAAArTTTTATCTTAGGTGTACl^ 

23752 GGACJ\CTCTAGATTTCTGCTGTCCM.TACAGTAGCCATTAGCC^^ 

23843 ATAAATCAAGArAAGTGAAAAATACGCACCAGATTTTGAAGGCTTATTCT 

23934 CTTACATGGAGACAGTATTTTTTATCTTTTAGGATAAACAAAATATATT^^ 

24 025 CACAATATATAAAATGACATATGTGGCCCCCATTGTTTCTACTAGACAGCATTGCT 
24116 AAAAGGATGATAAGGGGCCGGGCACAGTGGCTCACACCTGTAATCCCAGCACTT 

2 4207 TCGAGACCM^CCTGGCTAACGCGGCGAAACCCCGTCTCTACTAAAAATACT^AAAAAGTAGCCGGG^ 

2 42? 8 GCTACTCGGGAGGCTGAAGCAGGAGAATGGCGTGAACCCGGGAAGTGGAGCTTGCGGTGAGC 

2 4 JB 9 AGCCTGGGCGACAGAGGGAGACTCCGTCTCAAMAAAAAAAAAAAAAAAAGGATGATGAGC^ 

2 4 '45 0 tttaix:acgatataaaaatttgatggctcatgcctgtggtcccagatactc^ 

2 4jy7 1 aggcttcagtgagctatgatagtgccactgcacrctagcctgggtgatagagcaa^ 

2 4 j$6 2 TTAGATTTCATTTATTTTACACATATATTATCACTTGGAAAAT^ 

2 4SB 3 GAAAACAAGTAAAACACACAGAAGTACTTATTTTTTG^ 

2 4%A 4 CGCA7\C^CAGGTAAGTGGTAACCCTGGCCCAGGACATATGAGCTGATA^ 

2 4Sb 5 CM^CTTGGTAAAGTCAATTCTTCATACCTCCCTTTCCTTGCmCTA^ 

2 St) 2 6 TTTCMAAGAATGGATGGGCTAGCTVAAAAAAACCT^TTCTGATTCCTO 

2 ^'0.7 TCCCTAGTGGGCAATGACCAGTAATGTCCGGCAGGATATTAGATCACCTGCCCTACAGGAATA^ 

2 4 2b 8 AAAGACCCCACGCACTTGGCTGAGCAACCTCAAGGTGATCTTTGGGA^ 

2 i'Zp 9 CCTTTCCTAACCCAGCCGCATCACTGCCAAGTTCACATCACATCCAGTCT^ 

2 0 TCTAGCAATGGTTTTCATGCGTGAAATACAGCCATGGCCCTGAGGCTTTAGGCAAC^ 

2 5:|!B 1 AATATCTGCTTCTCTACCCATAGTGTTATTTTTATAATTGTCCTCTVTCATTATTAATAATAGTGGGGA^ 

2 5^5^ 2 CTAGACAGTTGTTTCGACAAGACATGAATCACAGAAGarACCTG^ 

25 663 ACTATTTTGGAGATTTCTTTACACCTTGGTGTATAGATTGCCATCATGa^A^ 
25754 TTGTTTCCTAGAACAAAGGCCGAGTTAGATCAAGAAGCCTTGATaVGTGGCAATCTGG^ 
25 845 AGGAAAACATTATCCAGGTGAGGAAAACAAACACCCAATCTGATTTGTTGGCC^ 
25936 AATTGTGAAAGACATAT^ATGTGATCCCATAGTACCTTTTTTAAAAAAATGAAGTTGAG 
26027 AATTCCAGAGATACCAAACT^TTCTTCTGGCTTCTTTGACTTAGGGCTTACTT^^ 

26118 AAATCTGTGTGAGTATGTACCAAGTTTATAATATGGATGTTGGGTTTATCGTTTAGTATCTAGAAC^ 

26209 GATGGGTCAACTCCACTTGAATGATGGTCACTGTCTGATATGGGAGCTATGATTATGACTAGGCTAa 

26300 AATGATGTCTTCTTTGGACTTAAATTTGTTAAGGAAAGTCATTTGTACCATGA^^ 

26391 actttgtaccatactaattttatcttctatgtqvttatttccaca;^ 

26482 AAAT<3\GTTCTTACATGTTTCCATTGAGAAGTCATTCATTAGACTAG^ 

26573 AGTGAT^ATCTTAATTCTCTAACTTTTGAATTGTCTAAATVTCAAAGTAAIX^ 

26664 TGACTTTAATATCTTTGATAACTTAAATGCTTGGTATCACAOT 

2 675 5 GCCCCCAAATT^ATTTGCAGTTTTCAAAGAAAGCTGAAACCTT^ 

26846 CTTCTATTTAGAAACATTGCTGCCACCAAGCAGCCCCTGTTGTA^^ 

2 6937 CTTGTATACAAAGAAAGAACTrTCCAAAAGATGTGACCCAGGATC^ 

27028 TAAATCCCAAACTCATTGGAAGCATTGAAATAAAGCCATTTGGAAA^ 

27119 caccaaactcttaagactc:accactggacatgg?^c^ 

27210 gattcaatgatgctgttcttccattcccccaggcgagctcggctctgg^ 
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27301 GA7OTCTCTGEAACTGTGATCAGAGTACCACCTACCTGACTCACTGCTTTGayV(^ 

2 73 92 TTGTTTTCTTCCTCTTAATTAAGAGTAAGATTCTCATCT 

274 83 GGCMr^\C7«JTAAAGCTCTTAiCTCTT^ 

27574 CCCATATTTGAAGTGTGGACCTAACTCTAGAACTTTAAAATGGCCATTCGCTGA^ 

27665 TTAGCAATTTCACTCTTTCTCTGTAATACCTCTGCTGAGTGAGATTAAATCCTCTATG^ 

27756 TAAAATGCCAGGAAGGTCAGAAATGAATTTCTCACGGCCTGAG(^\ATG^^ 

27847 TATTTATTTATTTATTTTTGAGACTGAGTCTCGCTCTATCGCCCAG(^ 

27938 CCTGGGCTCAAGCGATTCTCATGCCTCAGCCTCCTGAGTATTGGGATTATAG^ 

28029 TAGAGACAGGGTTTCACmTGTTGGCCAGGCTGGTCTT^^ 

28120 TACAGGCGTGAGCCCCCGTTCCTGGCCrATTTTTCCCTTTATT<3^GATCTCM 

28211 GTAAAATTCTTCTGCTCATCCTTCTCAGGACCATTTTCTCTTTCTTC^^ 

28302 CCATCATAGTTGTGATTTCACCAGTGAATGCGACCTGGCTCACAGTGCAGT^ 

28393 ATTAAATCTCAAGTCTACTCCATTGCTTAAATCCATCTTCTGATTCACATAGCTC^^ 

28484 CTTTTTGCTTGTCCAAATGGACATTTGCATATTTCAACGGTCCAGAA^ 

28575 CTCCTAGTTTGGAGACTTACTCTTTGAAGAGGAGGTGGAACAGTGTTTCGACCTATGT^ 

28666 GATGTCACCCGGAGCCT^GCCTGTGCCACCCTTTACCTCCTCATGAGGTTCAGTTTT 

28757 TGACCTGGAATCAGTAGAGAAAAATTGATGTAAAGCATCAGCTGCGAAAAAAAAT 

2 Q| 4 8 TGTGAGAAAGAAACAftlTGAGTATGTAGATAGATAGCAGCl^^ 

2 §13 9 CTCTATCAGTGTGTTTCTAAAATAGACAGCCAGGGGCCAGGAACGATGGCTTTCA^ 

2 ^^3 0 GGATCATTTGAAGTCAGGAGTTCAAGACTAGCCTGGCCAGCATGGTG^ 

2 9i2 1 GCGCATGCCTGTAATCCT^GCTACTTGGGAGGCTGAGGCAGGAGAATTGCTTGAACCTG^ 

2 §ai 2 CCATTTCACTCCAGCCrGGACAACAGAGTGAGACTTCATTTCAA?^^ 

2 §§0 3 TGTAATCCTAGCACTTCGGGAGGCCAAGGCAGGCAGATCAGATG^ 

2 9pS 4 CTCTACAATACAT^AAAATTAGCCGGGTGTGGTGGCACACACGCCCGTAATCTTAG^ 

2 ^4-8 5 CCAGGAGGCAGAGGTTGTAGTGAGCCAAGATTGTGCCACTGTATTCCAGCCTGA(^ 

2^576 GGAAGAGTATTTTAGATTAAAAGTTATCATCTGrrGGGGGAAAAAATACTAT^^ 

2|i67 TTCTGACTAGCTAGTGCCAGAATGACCTGTGGAAGAGGATTTTAAATGATCGGTGTC^^ 

2 f 15 8 TTTATTTATTGAGACAGTGTCTTGCTCTGTCACCCAGGCTGGAGTGTAGTGGCACTATCAGA^ 

2 $a4 9 TWVTGATCCTCCTACTTCAGCCTCCCATGTAGTAACTGGGATTACAGGCACGAGCCACC^ 

2 ?>g4 0 AGACAGGGTCTTGCTGCATGCCCAGGCTGGTCTAGAACACCTGAGCTCAAGTGATC 

3 q[g3 1 GGCATGAGCCATCCTGCCTAGCCAAGACTTGAGTTTTATTCAAP^ 

3 Q'l^ 2 GCTTTAGTTTGTCTAGATTTTGATACCTTCTTTGGAATTTCCATTTGTGGCC^ 

30213 TGGCCCATGGAAAAAAGTCAGCCTCCTCCAAATGTATTAGGGATGATTATTTAAAAGAC^ 

30304 TTCCATGGGCCTGTAAAGAAAGAAGAAACAAAACCTTGTTGCTTACCCGGAGTTCAAAATC 

30395 TTTATTTATTTAGAGACAGGGTCTCGCTCTGTCACCGTGGCTGGAGT 

304 8 6 TTAAGCAATTCTCCTGCCTCAGCCTCTCGAGTAGCTGGAATTACAGGTGTCCACa^ 

30577 CGGGGTTTTGCTGTGTTGCCCAGGCTGATCTTGAACTCCTGAGCTCAAGTGATCC^^ 

30668 CCAGAAGXCCCTTTCTTTTAATAAAGriTAAATAAAGTCCCAAC^^ 

3075 9 TTATAAGAATCTTCCAiSCTTGCAGCACAAAGGCAGCCCAGTCCT^ 

30850 CCATITAAACCATTTAAACAACAATATGAATGTTGTGCAAAGTGTAGCTC 

30941 AGGAAACACTGAGGAGTTGTTTGTGTCTGGCCATGCTGCTTTCAGTTATCTACTGCTAAGTGTGTC^ 

31032 TTTCMTGATGCAGAAlTTTGCTWSACTAAAGArGC^^ 

31123 GCACCTGAGAAGATCCTTGAGCS^CAATrTTGGCCTATTCAGAAGAGGAC;^^ 

31214 AGCACATACCTTGTCTCATGCOTGAGTTTGGGATCTGCCJ^ 

31305 ATAACTOTTGCAAGTATTTATTGTCCTATATGCATGTGCTCTCA^ 

31396 TCTGAATATGCTAACAGTCTATTTGGAGTGCTCAGTCTO^^ 

31487 GGTAAGGAGCTAAATGCTATGC^GGTTAGAGGAAGAGAGGTCGrGATCA^ 

31578 acactaaaggctggggtaggattccgctaggcaggggaaaagtgaggggga^ 

31 669 gggatggtaaagggaccagcttgtctggtgaggggtaaggaagggccgagtttgagagagcttgt;^ 

31760 taaggacagtcatctcttccatgtccartacgatgaccgtggaactctagcctgtccc^^ 
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31851 TATGGTAGGGTCTTAATTAAGTGAGAATGAATGAATG^ATGTTTTCT^ 

31942 CCCTCTCGGGO^GTTGCATCCTCATTCTCAGAATCTTTTTCCT^ 

32033 TCCTTTAGTTCCM'COTTTATJffAGAGGTATTCACCJ^^ 

32124 i^GAAATGACrCTAGTGGTAGAAATTTTAGGAGCCCT^^ 

32215 CTGCCAGGGTCTCTAGATCAAOrCATAGTCTCTCCCTGrrTTTO 

32306 "I^AAATGATTACAAGAGCyiTCTAAAATCTCTGAAGCCC 

32397 GCTTCCTACTTGGTATAAAAATCCCTGATATTCCAGAGTAGAGTTTAGAACTT^^ 

32488 TAAACAAAAATGTTGGAGATAGAGATAGTATGAGGATTCA^ 

32579 CTGGGCTGGCG?^GATGAAAACTACAGAGTCAGGTCTAGATGACTCCGCCATCCC^^ 

32 670 GCTTCCCCTTGCATlTCAATGTAArCCTAACTCTTCAC^^ 

32761 (^CTTCTCTCTAATCTGAATAGCATCTTATATGACACA^ 

32852 ACAGGTAAGCTTTCCTGACACACTCAAGGGACACCA^ 

32943 TAACTffGGATATAGTGATTTGGATGATAGCCAAATAATATATAGAAATT^ 

33034 GATCTCCCTAATTTAAAGTGAGGCATAATAATGTTATTAAATATAATAAm 

33125 TCAGAGAAAGAATAATTAACTGTTATAGGAATTATTTTCCATATGCT^ 

33216 C^y^ATCATTTTTTACGCGTAGTTTAAAAGTTGGTCTTCCAGTCCTO 

33307 ATCATACAAAGGGCAGTACTTTTGTCTTCTGTTTTATTTTGAGAGAAAGGAA^ 

3^^8 CATCArGTTATCAGGTATTTTTTCCCCATAAGGCTTTTACTAAGTAC^ 

33^^9 TTGGACTAATAAATGTTTTCTTCTGTCTCGTTTTCTGGAAATATAGGGC^^ 

3 iSf 0 AAAAQ^AAAGGTCACACAAAGTAGAAGAACAGTGTCAOT 

3 3 f7 1 AGAGTTACCAGGCATCTCCTGATCTGCGGCTGACCTGGCTCCAGAACATGGCA^ 

33-:^2 CATGTGCCTGGTGCACGCCGCTGCGTmGTGGCTGAGTATCTGAGCATGCT 

3 3,f^ 3 CAGGTAGGGTGTGTGCAGCTTTTCCCTTAGAGCAGTGGTTCTC^ 

33jP 4 AGATACATTTTTGGTTATaVCAACTGGGATGGGTGAGTAGGTGCTACTGG^ 

3 4m 5 AATAGGAGAGCTCCCCTGACAAAGAATTGTCTGGCCCO^AATGTC 

3 4a 2 6 TTCTTCTAATTGCTTTCCCATCGTCGTTGGGTTTTTTTTAAATTACTGTTTACJ^^ 

3 4m 1 GACATTTGCAAACACTTTCACATGCATGGCTTCAOTTG^ 

3 4'|l) 8 CACTTCGCGGATGAGAGAACTGAGAGAGCAAGTTTTCTAAGGTCACTTAA^ 

3 4=3^ 9 TTGTGAAAGTTTGGAAAACATTGGATAAATGATTTTCCTCCTGGCCCATTC^^ 

3 4=^^ 0 AATCCAAAAATCAAGAAAGAATCAAATTGACCTGAGAAGGGAGAC^^ 

3 4a 1 CATTTGGCAAAGGGAAATTGTCTGCCAGACCTAAAAGCTGGCTCG^^ 

3 4:&^ 2 TAATAGTCTTTAGAGAGAGAGAAAATAAAAAAGCACAATGTTGGGTACT^ 

34763 AACCATGAACAATATAGCTACAGTAATAGAGTGTTTTTCAAGCCAG^^ 

34 854 TTACATGGCTGATTTTATGAAAAGCTTTGTTCTTGTTATTGTTCTTC^^ 

34 945 TCAAAGTAGCTTCCCCAGGCTTAGAGAATAAGTCACTGA^ 

35036 GGGTATTGCCTCCTATGAATAACTCCTCCTTTCTTATGGTCCTCAAGAACAAAAAT^ 

35 127 TTCTTCTTAATTCAGAAATGTTTGTTATAAAAGCTGATAATTAAATCTC^ 

35218 AAGTGTACAAGAAAGTTATAAAAGTTATTTTAGATGTATTTGCTOCTTCT^ 

35309 ATTAaAGTACTACTCTTTCAATTCCCASGAAATTGTAAGGT^^ 

35400 TCCTTTTCCCM'GACTATATTITATTTTATAT^ 

35491 GTTCTAATTTTACCTAGAATTCAGTTGATTTGCTAATAATGACA^ 

35582 ATTCCGACATGGGGGCATACAGCTCTATCTGTTCACATATATTTATCC^ 

35 673 CTCCTCATGAACGTTACATTCTAACCAGAGAGACGTAATATAACTAAT^ 
35764 TAAAQVSGCACAAGATATTGTGAGCAlTTATGGTCa 

35855 GAAGTGTGAGGGCTTCTTGATGTAGAGGAGGCAATGAGTTAGGTGTTGTC^ 

35946 GAA3TCTTACITCTCTCAAATGGGACATACCAA^ 

360:i7 TAAAAATGGAATTTATCAGCCCGTCTGAGGAATGTGftGCCTGGRGTAGCAC^^ 

36128 GCTCACTGTCATCACCACTTCCCATCTCTGCCATCTGTTGTGTCA^ 

36219 TCTAGGTTCJ^CTAGGAGAGAAGAGAGATGCCTCCITTCTAGCAGTT^ 

— TCTGATCCTGTGCTCATCCCTGAACCAGCCATACTGGCCAGAGAAATG^ 
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36401 GAATaT^TGGGTGATTTCTTCCCCTAAATCAGGATGCCTTACTTTATAGAAGGAGGAAAGAATTCTA^ 

364 92 TGTTGATTTTGCAAATGCTCTC^TGCTTTAATTTTCAACCTTGTTCTGCTCC^^ 

36583 TAAATAAATATGTGCAAGGAAAATAACTTTGAGGTCACTGAATICC^ 

36674 TATTTCACAACTGTAGATACGGACACATTCTTAGATACTGCTGl^^ 

3 67 65 TTTCCTCTGACTACAGCTAATGAGGCTACATAGCTTCAAATCTGTTCCTCTAATCT 

36856 TGGGTTATTAAATGCTTTCAACTGAATTTCTTGGArCTTCTGTTGTC:ACAGA7VA(^^ 

36947 GCCTVCAAAGGCTTTGAGAGCCTCCTCCTATATTCCTAAAACTACGTTACAGTATTGC^ 

37038 TCCTGACTGATTGCTAAQSTTGATTCACATGATCTTGCTAACCAGGCCAG 

37129 TAGTAGTGGCTGACTGGAGAACTATGCTTAAGAATTTCGAGACTATGTCCAAGCTCTGGGGAAAAACT 

37220 GCCAItiATTACAGCAATAGGAAGGAGTGGCATCTGTGCCACCT^^ 

37311 TATGCATTTCTTCCATAGCTGTGAGCTAGGAAGAAAATGATTCTTGACCTGTC^ 

374 02 CACTCACCCTCAGGCTCGAGCGGGTACAAGATCAGTACTTCCATGGCCCTAA^ 

374 93 TTGGTTCACCCCAACCATGGTCTCTGCATGCTCTGCTAGAGGCTCTAAACGCAAT^^ 

3758 4 TGTTCAGGAAGAAC^W^CACACACACAGTAACTGCTGC^ 

37 675 TGTCM'GGCTGTTTATCTGAGTCAACTCCAGAGTAGCAACATACTTCAGAAAAAC^ 

37857 CTGTTGTGACATTGAGGTAAGTGTCCTTTATACAAAATCTCCTAATGGTTAAAAA^ 

3 7 $48 GCCTCTTCTAAATGAATGGCAGACAGATACTCTCGGGGTAGAArrACAGACCTAGrTTACT 

3 8 g9 TTCCTCGTTTCCCCATTCGGGGTTCCTGTGGTCTCTTACTAGTCTGGTCGCCCTGTTCTCCAGGCTTAm 

3 8 1^0 TTCTTCCAATGTGCTGGAGGAGTCTGTGGTCTCTGAGGACTVCCCTGTCACCTm 

3 8 2 |l AGTGGCCTGGTAGGCCTCCTGGAGCAGGCCGCGGAGCTCTTCAGCACGGTCAGTGCCCAG^ 

3 8 3pl2 CAGCTGGACTTGGGGTGCTGGGAACACCTGGTCTTAATGGCCCAGTCAGCCCCACTTCCCG^ 

3 8 4J3 GATGGGCCCGGGGAGGACTTTGATGTATGGAAATTGCATGTSlGCTTCCCA^ 

3 8 43a TCACTGTATGCTCATTGGTTGGGCAGCAGTTTCACAGTATTATTTCTATTTAATAG^ 

3 8 ^15 GCCCAGGGTTACACAATAAATGATGAGGCATGTTTTCACTCCCTCGTT^^ 

38 676 GAGAGAGGAGGAATACACAGACAGTGTCTTCCCTCCTAGCCACTGTGCAGTCTG^ 

3 8 747 TGGGCACAGAAACCACTGAGACTCCTCTGGTTAACGTAATCTGGATCTAAACACTCCTAGTATATATAC 

3 8 $18 AAGTCATTGAGATTCAGGGCAAAGAGGAAACACrCTTGrCTATTT^ 

38^49 AATGCAGTGGTGCAATCATGGCTCACTGCAGCCTCTACCTCCAGGGTTCAAC^^ 

3 9 0 AGGTCTGCACCACCACACTCGGCCAATTTTTTGTATTm 

3 9 13 1 CTCAAGTGATCTGCCCATCTTGGCCTCCCAAAGTGCTGGAACTACAGGTGTGAGCCM 

3 9 S3 2 CTCCTCCTTTACATTTTAAGCCAAGAAAGTATTCAGTACTTTACTATATTTAGCTGACCC^ 
39313 TTATTTTCCAGTTTTTATTTCCT^AGTTTCCTCTCTACAATTTTA^ 

394 04 CTTTTCAAGATTCAGAAAATGTCTAATATACTCTCATTTTTCCTCAAACTC^ 

394 95 ACATTTAGCATCTGGCTAGAGGAGGACCTCTGATGAAATTTAAATATACTAAAACTGCCT^ 

3958 6 CTTCTCTCCTGTTTTTTCTTTTCGTTTTGTTTTGTTT^ 

39677 GATGCAGCCTTGGCTCACTACAGCCTTGACCTCCTGGGCTCAGCCTCCC7VCCTCAACCTC 

39768 CACACCTGGCTGATTTTTTAATTTTTTTGCTVGAGATGGGGTCTCCCTATGGTG^ 

39859 CTGCCTCAGCCTCCCAAAGTGCTGGGm'ACAGGa^TGAGCCACTGCA^^ 

39950 TTATCGCCATACAGGACTACTTAGCGT^GGTGTCTAGTTC^ 

40041 GTGAAATACTGTGATACAACAATAAATTCACTCTCCAGCACATTGTTTGG^ 

40132 AAATTCTCTCTGATGCTCTTCTCCTCTTTCCAAGGGAGGCTTATATGAGACT^GTTAATC^^ 

40223 GCGOCTCGAGAATTCCGGAAGCTGACACTCACTCACAGCAAGCTGCAGAGAG^^ 

40314 CX:TGGCAGGTCTTGTTACCTGGrGGC7y33CGACCCTGTCCTAC7\GR^ 

40405 ArTGCTTTTGCTCTCACCTGTCAAACAG?W^GGGCTG?W^TTCOT 

40496 TTACTTTATAACCAGCSVAATGTGAGAAATTTTTAAGT 

4 0587 TGAAGTTTATTCCATAAGCATTAAATTTTTTTAAGGACTAAT^ 
40678 AGACCCCTGCCCTTTGCAACTCAGTGGCTCCTCAGGATGACAT^^ 
40769 GTTTGGAACCTACTTCCGAGTTGGTTTCTTTGGATCCAAATT^ 

408 60 AAGCTTCCTGAGATCTCACATAGACTAGAGGTAAGAAAAGTGATTCTGTGC^ 
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4 095 1 TCATCACCAAAAATAAACAAATAAAGJ^GTTATGGMTC^ 

41042 GAAAGGTATTAGGTGAGATCATGAGGTGATGTAATAm 

41133 TATGCTGAATATTTATTTTGAAGTCTGTGCTW^TTCJ^ 

41224 ACTAGMAGAGAGTAGrAGAGTTTTTCAACATGAAGTTTAG^ 

4 1315 M-GGTGAGGCCATCCTTTAGGTATCACTGGMGGCACCTGGGGTCOT 

414 06 GCCACTGAGTGACATTTGGTCmTAATACAACAGAA^ 

41497 CCTAGAGAAAGGTCAC^CAAAAGAAACACAGCTTA^ 

41588 C^CAGGAATTTGTTTACAAAGAGCCTGCAATTACCAAf^^ 

41679 GGCTTTCATTGCTTTGAATTTGTGCCCTCCTATG?VGTGrAAMCTTAAMCAOT 

41770 CCAAAAAAGACTAGAGGGTTTATCCTTACTTGACTCAAA^ 

41861 TG^VGAAAAAGAGCTAGCAATTGACCCCAGTCCACTTGGTTCCCTATATT^^ 

4 1 952 CTGC7\CTGTGATTTGCCACAATATGGGAAGGCTGGTGACTTCC^ 

42043 GAGATACCTCAGCAWATAATGATGTTAGCTGAATTAGAGGCC^ 

42134 TTATTTTTGTCITAGGAAATGCTGAACTTTGCTCATTAGCTCT^ 

42225 CAAACTTATTTCACTTCCAAGAAGACAAAC^ 

42316 GGTCAATGTTITGGTGCAGAAITTGTGGAAGTGATTAAAGACTCCACTCCTGTGGAC^^ 

424 07 ATTTACAAAAACTAACCATCAAGCTCTAAATCCCTTCGTTCTCTACCCAAG^ 

42498 TTAAATGCAGTCAAACCTTTTCGTCTAGAGTTCAACTACTAATTC^^ 

4 2S8 9 AGCTACCACTTATTAAGC7^CC7\ACTGTGTACCTGGAACTGCATTAGGCCCTCTACATA^ 

^ZiMO AAGTAGATTTTGTCATTCCTATTTTAGAGATGAGAAAACTGAGACATGGAAGAGTC^ 

4 2 ?! 1 GTACTCAGACTTGAGCTCTGTTGCTTCTGAACCCCTACTCTTCAGCACTG^ 

42;^2 CAAACTCGACAACACTTAGTGGCTTCCTTCTTTAGGCTGCAAGTATTCCTTCC^^ 

42=9^3 AAAAACGGCTTGGGTTTGGGTTTCCTCACTTTCTW:^ 

4 3;|4 4 TGGAGGTGGTATCAGCTCCCACAGGGTAAAGGCTCAGTCCTCCTCAAGA^ 

4 3|i|5 CCAGCCACGCTTCTGATCAGCCAGCTACAAATTTGGGAGTTTCTATAA^ 

4 6 ATAATAATAATAATAATAATTTGGGAGrrTTCTATGATACCTGTTAGATT^ 

43|17 CTGGTTACTATCAGAGTATTATGATAAAGGGTGAACATCTTTTTGTGCCTTTAm 

43:4^98 GGAAGCTCCGCCAAGTCTCAGTGTCOW^GATTTTGTTGGGGm 

4 3|#9 CCAGAGGTCATCCAGTTCCTACCCTGTAATCACAGAATTGGTCTTTTTGGTGGCCAC 

4 3i5|0 AGTCACCTCATTAGCATCACAAAGACACCCATCACTGAGGAAATTCTAACT 

4 3g 1 GACATATTCmATTATACCACAAGCTGATGCTGCCC^^ 

43:1^2 GCAAGTGCAAGGTATCATTAAGGGCCCTGGGACAGAGGACCATTC^ 

4 3 §"^3 ATCTTCCAGGATATGAAGAAGATAATGGAAGAGGGGAAGAATTGGCCAAATTGAAGAGm 

43954 ACTTAAGACACATTCCCTAGCCTCGGCTTGAAAGCAGTGGCTGTGGTAAGAGT^^ 

44045 TTGGCCGTGTGCCCACAGTGATCTGTACATAGCACAAGCTGTGTAAAA 

44136 ATTCCCAGGGTAGGTTGTGAGAGTTTGGAATGAGAGTCTGAACCCAGAGrTAC^ 

44227 GAGGTCAGTGACCCCT^TCM-CCCTTTCAGTGGGTGACTGTCCCAG^ 

44318 TAATGAGTACTTATTCGCTCTGTGTTTGAGGGTGGAGCTGATGTGAGTGrGTGT^ 

44 4 09 TTTAOTrCTATGAGTrrTGACAAACACATGTGACTACCTTCAT^^ 

44500 ATTGCTAGTCCATGrrGCAATAGTTCCTTCCTTTCCGTTGCTGft^ 

44591 CACTGGAGTGACATTTGGGTTGTTTCTAGTCTTTGTTATGAATAAAGT^ 

44 682 TTCM-TTCTCTCCTGTACACACATAGGAGTGGGGTTGCTGGGTCCA^^ 

44773 CAAACTAGCTGTGCATTTTGC^CCCACCCTCTGTGTATGAC^ 

4 4864 CTTTAGTTCGGCCATTCATGCTTCCTCATAACTGTGAGGTAGTC^ 

44955 CAAACTCTCATACCAAGGCTTCCACTGTGGATCAGTTTGCTCTGCTA^ 

45046 CCCTCrCCCTCCTTGCAGCCCCTTCTCTCCTTGGGGGTGGAGAGGAGGAACOT^ 

45137 GCACTCGGCAGGGCGGTGCCGGCACGCCGTGTTCCTGCAT^^^ 

45228 CTCCGTGCCTTTTCCCCCTTAGGCCTACATACAGATCACTTTO 

° ^^^^^'I^^^CCTCCGGAGGrTCATGTACACCACCC^^^ 

GAAGGAACACAGTCCTGACCACTATGCACGCCTTCCCCTACATCAA^^ 
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45501 GGATTGGCCACCACTGGATGACTGGGCTGGGTGGCCTCCO^^ 

45592 C^TGCACTTGAAATATGATTATATGGTTGTCCTTTC^ 

45683 TGTATATTA?\ATATTTAATATTTTAAAGTTATArCTTTAGTArAAGTG^ 

45774 TAAACTTCTACAACAGATATATTTACCTTTTAAATATTTTATATAAAAm 

458 65 TGCTCTTAATTATTGGAGACGGGACCTGCACGTGGGCASCCCCAGTGAGGCGGT^^ 

45956 TTCACTGACTASCTTCTAGTCCTAGCITCCTCCTGTGATTTTAAACT^ 

4 6047 ACCTCrAAGTA:::AGTACAAGATTATACTCATm^ 

4 6136 iU^ACACCATAAACAACTTATTATTCTTTAAAAO^^ 

4 6229 CTGTGGGTTTGCTGATCACAATTCAGCMm 

4 6320 CAGACTGAGAGCTTAAAGATCCTTGTTAAACAATTTGAGAGC^ 

4 6411 CAGTCTTGTGCACACCTAATATGACAGGAATTITTATA3<^ 

4 65 02 J^SAAATCTTTTTCTTTCCACACCCM'GGTTCATCAGTIT^ 

46593 ATCAAGTTGGTGAACAAACACAGATGAATTGTGGTGAATATATACCT^ 

4 6684 CAGCAGGTAGTGTTCjEUSCTGGGTAACCACATCAGGGTG^ 

4 6775 ctgggtttagaagccatctgctgtacttcctatacttcttggtttggacttto 

4 68 66 TCATGGTGCCCAAAAGCATTTCTWSACCTTTTCTTTTCTTCA.^^^ 

46957 TCTGTGTCTCTGTCTCTCTTTCTCTCTTTTTCATTTTTCAGACA^ 

4 7 8 CTGCAGCCTCGACCTCCTGGGCTCGAGCAATTTTCCTACCTCAGCCTCCTGTGTAGCT 

4 7 9 ^OTAAAAATACAGAGAGATATATATTTTGTAAG^GACAGGGGTCTO^ 

4 7 ?| 0 CTCCTACCTCAGCCTCCCAAGGTGCTGGGATTACAAGTGTGAGCCACTGCACCAGCCT^ 

^"^i?!^ ACCTCAACAACAGTTCTCTVGCTGTTTGCTAGGGCTGTAGAACAAAGTT^ 

47 f3i2 CAATTCTAGAGGCTAGAAGTTTGAAATCAAGGTGCCAGCACTGTTOT^ 

47^g3 TCCCTGACCTGTGGACAGCCCTCTTCTCCCTGTGTCTCITCACATCATCOT^ 

47gfl4 CACCAGTCATATTGGACTAGGGCTTACCCTGCTGGCCTCATTTTAACTT^^ 

4 7 f 1 5 CTGAGGTACTGAAGGTTAGGACTTCGACATATACATTITGGGGGRAC^^ 

47776 CXIAGAACTTAGCAAATAGATTGATTGACCCTTAAAAGAATTCCAT^ 

4 7 p^7 AAGCCTGTGTmTGATGGGGGAGAAAATCTTGAGTGCTGTGCTTCTACTAC^ 

47^8 CGGTGGCTCTTGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCAGGGGGATTG^^ 

4 8 i>4 9 AG?VGAAACCGCATCTCTACTAAAAATACAAAATTATCTGGGTGTGGTGGTGCATG^^ 

4 8 y 0 C^TTGCTTGAACCCGGGAGGCGGAGATTGCAGTGAGCCAAGAT^ 

4 8 0 1 AAATAAATAAATAAATAAATAAGTAAGTTGAGTAACTTGCTCAGTAATA^^ 

4 8 33 2 GATAGACATACTTGTC21TCGAATTATGACCTCCTCCCTTACTAC^^ 

48413 AAAATGGAGACTTATAAAGAGGTTGTTG^^GAGGAATAAATGCTAAATGTATGTAAAG^ 

4 8504 CATCTTTCTTTTTAACCTGTTAAGAGTATTTTAAATTACTCTGAAAGT^^ 

48595 TAGCTCCAGGCTTGTCCATAGCTTATG^GACCAGACAGTGACTTCCCTOT^ 

4 8 686 TTGCCATTGAAGACATGAAGAAGAAGACCCTGCAGTTASCAGTTGCCAT^ 

4 8 777 CKTAGGCTCTGTGGGAGCTACTGTAAATCT^GGTAAGCAAAACC?^ 

48868 TCTGCACCCTTCTTCCTTGGGGTTGATGAGGAOTTGATCCATAGACAAA^ 

48959 TTAGAAGACTITTAGGAAACCTCTTCCCTTTCOTGTAAC^ 

" " " TACCACOTTCTCAGAGGAGTCTCCACTTCGGGGCC^ 
CAATTCACGCAGAGTAATTGCAGGGCTTGAG^VCTCA^ 
TCAGTCAGCAG?ITGAACCCAGCATGCCAAGG?VCCTTGAC^ 
A3'CTTCCTATC7VG?^CTTCTTAGftACGCTG;^CTCCCAAGTTCAGCATGGT^^ 

4 94 1 4 TCTGTCCAGTGGGCCTCCTTAATTCAAATCTAGATCTGTTCTTGT^ 

49505 CATTCAGCGTAATTGGATTTCCCCACAAAAGTTCTGA^ 

4 9596 CC^GATCAGGAAAAAAAAATG?VGGAAACGTTTGCCTTTGTAACT 

49687 CTCTATCTGTACACGATGGATATCTCTGACTCTATCCAGCAGTT^ 

4 9778 MGAATAGATCAAGGCAATCCTGATTTTTAGAAAAGCAACTC^ 

49869 AATCTATTATTAGTAGTATTTGATTGATAAAATAATATTCGGC^ 

49960 AGGTGGGTGAATCACITGAGGTCAGGAGTTTCAGACCAGC 
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S 0 0 5 1 GTGTGGTGOIlTySGCGCTTGTAGrCCCAGTACTTGGGAGGC^^ 

50142 GATTATGCC7^CTGCGCTCC7\GCCTGACTGACA.GAGTAAGACTCTCT 

50233 CTGCGCACGCCTGTAATCCCAGCT^TTTGGGAGGCTGAGCT 

50324 AAAATCCCATCACTACAAAAAAATACAAAAATAGCCAGGTATAGTGG^ 

50415 ATC:ACTTGAGCCTGG(3^GTGGAGGCTGCAGTG3\GCAGAGATCOT 

505 06 AACAAAAACAAAACACACACACACAGGGAAAATACATTT^ 

50597 TTCCTGAGATCTCCAAGACCTTCCTTATTTTTTT^ 

50688 ATTTTCTCCCAAGCAATTTAATTTCCTTCATAaATGTATGAM 

50779 ACATATACATAXGCATATACATATATATGTATGTGTATATATATATGTGTGTGTCT 

50870 TTTTTTTTTTTTTTCCCACCAGGGACCACTGGAAGTAGCCC^ 

50961 AACAAGTTGAGGTTATGCTTTAAGGRATTCATCATGAGGTAAGAAGGAAAAT 

51052 ATCTTAGACCAGCAGCTTCAGCATCACCTGAGAACTTGCTAGAAAGAC^^ 

51143 GAGGTGTGGCTC7\GCAATGTGCAGTTTAACCCGCCCTCCAGGT(3VTCCTGATGC^^ 

51325 CAAGTATCMCAXCTTCCTCATTTTACAAAAGTSGGA^ 

51416 AGCAGAGATGAGCTCAAACCAGGTCTTCTGAATCCAAATAGTCCACATGTCCATCAATGT^ 

51507 GAGGTTCTTTGCATTGGCAGTCATTGTAATGTTACTWVrCTTATAATATCTTAT^ 

5 1598 GAGAGGCAAGTTCTAAAACCAGGCTTTTCAAAGGTATTATCCTGTGAC^^ 

5 1 6r§;9 GCAGTGGAGCTGGCTTCAAGAGCAGTCAGTTAGGAACTGGGTCTTTGGTC^^ 

5 iT^i) TCTCCTATGGTCCTAAAAATATGAGTATGAAGGATGTGATGGCAXTTCACCTATAAGAGAAAA^^ 

5 1 8 7^1 TCTTTCTATTCTW^ACCTTAGAGAAGAATATTAAGTATAAGAATATCITCACTTGGCAGM 

5 1 TTCAGTGAAT^TCATCAAAATGCTAGCATTTGTGAGGTTAGAGATATCT 

5 2 pfe CATATACAGGAGACACACTGTGCCCTGCAGTGAATCACAGCCGCTTGGTAATAGGA^ 

5 2 1=14 GTGTGTGAACATACTCACAGATTATCTTGATTAATGTTTTGATATTTGAATTGTTr^ 

5 2 3%5 TTGTTGTTGTTGTTTTTTCAGTACATAATACGTATCTTTTAAAAGGCTAGAAAAATATTl^^ 

52326 GAGACAAGGTCTCACTCTGTTGCCCAGGCTGGAGTACAGTGGCA^ 

5 2 il7 CCTCCCTGAGCCTCCCAAGTAGCTGGGACTACAGACGTGTGCCACCACArCTGGCTAAT^ 

5 2 $m ATGTTGGCCAGGCTGATCTTGAACTCCTGACCTCTWVTGATCCACCTGCCTTGGTC^ 

5 2 5 S 9 TGCCCGGCCCTATTTTTTATTTTTTGTGGAGACAGGGTCTCACTATGCCGTCTAGC^ 

5 2 Bg 0 TTAAAAT^TTTTTTTTCATATATTTTTATTTAAAATTCATTATGGTTGAATGCOT 

5 2 7§ 1 CCTATGAAATGTTTGAAGAGCACTCACrATATTCCAGGCAT^^ 

52932 GACAGGACATAGCCCTGCATGGCTGAATAATAGAATGCAGCATATGC^ 

52963 AAAGAGAAATCAGTTTCAGGCTGGAGCATTTGTGGGAGGCGTTATGGAA 

5305 4 AGGCGCTAGTGGGGAAGAGGGAGCCAGGACAGTGGCCCTGCCTGATCCAAGGTATC^ 

5314 5 TATTJ^TTACAATATGATATTCAGAGAGAGAGAGAGAGACCACATTCACATAACri^ 

5 3236 CTTACTCTATCTAATTTGTAAATTAAGCTTTATCATAGGTATGTATGTATAGGAAAAT^AAC^ 

53327 AGTTTCAGGCATCCCCTGGGGGTCTTGGAACCTATCCCCGAGAATGAGGAGGACTGAGT^ 

53418 CTGTCAGGAATGCCTGTGATGCACCGGGTTCCTCTAAGrCCAGCTGGAGCAGCGTGTTCra 

53509 GGRAATATGATAATATCAGTAAATGAGGCTCTGAATACAATTTAAATGTTTAAGAAAA 

53600 TTGCCrCAATTTTTTCTACTAGTTATAAATCTTGCAAIXrCAGTATCTCA^ 

53691 TTACAAAATCCCTACTAAACATGCGTTTCTCATTAGCTCCCA^^ 

53782 CCCAGGCAAGAGATACAAGAGGAAATGGACTTTCCTAGGGGTCGCTCCTITAACA^ 

53873 CCCCAGATGGCCTAGAGAAATGTTGGATTTTCAAAATCAAATTCACC^^ 

53964 GGAACCAGTCCCACCTTTCTCCCCACTGGCTGGGCTGCTGCATTGTTCTAGGAGC^ 

54055 CTTACACGTCACCTGAGGATTTGTCAAAATGCAGATTTTGC^ 

54146 GGAAGGACTGCTTGAGCCCTVGGAGTTCAGGACCAGCCTGGGCAACATGGC^^ 

54237 TTTAGCCAAGCGTGGTAGTGGTGTGTGCCTGTAGTCCCAGCTACTCGAGA 

54328 CTGCAATGAGCCATGATCGTGCCACTGCACTCCAGCCTGQSTGACAC^ 

54 419 TGATTCAGTAGGTCTGGGGTTGGTCCTGAAGCACTGCATTTCTAACAGGCCCCC^^ 

54510 AGTGGAGAGCCTGCAAAGATTGCTTAATTTCCTCCCTTTCACAI^ 
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54 601 GCCATCCTTATCM'CJ^TTTACCACTTTTGCCGTCTGCTTCCTCCCAAAAAATAA 

54 692 AAAAAAGS^GTTGTAAATCTCMTAAAATTATGTGAGAAC^^ 
54783 CGTCAGTTCATCCATTCATTCTySCAGTTArTG 

54874 AGTAGGAOVVGGCTACAGACAAAACAGTACTGCTGCTCTVrT^ 

54965 TACAffiGATCAGAAGTCGTGGCTGAGAGTGTGCTGATGGTGC^ 

5505 6 TGAATGTTACGGTATGTAAATCATACCCTGAA3ACAGATAGTAGATG;VGGTTCCTCT^ 

55147 TTACAGCTTTTAATGAATAGAGTAACTATTTTTTCACGTTGTTTCT 

55238 CGCAGAAGTGGAAAAOATTTGGGGCTTGAACAGCTCTCAGGTTTCCCCTTTAGTAAGC 

55329 ArAAGCCCAGATTTCC:AAGGTGA.CCATAAATGAGTCTCTGTTGCTGAG^ 

55420 GTGCCACCACGGGGCAGCGACCTrrCTGACTCAGCTGmGTGGAAGC^^ 

55511 AGTACTCATCCAGCCAGTCCTGCAACTCTTCAAACTGl^^ 

55 602 ATCTCGGGTTCAGAAGCCAAGACAAGAAGATAGAGAGATACAATCTTCTATCTTC 

55693 TTTTAAATTTTCTTCTTTATACTTATCTGGACTTTCTCATTCTTATAAAGACTAAGTGATGTTAT^;^ 

55784 AGTAGTACTCACTGGCACCAGTTACAGCTTGCCTTTAAGAGAAGTAGTTTCAGATAC^^ 

55875 TAGGCTCAGAAAACAGGCTGTTGAGTGGTGGCGTTTTTAAACTGGAGTTGG^ 

55966 CAAAGGACT^ACCAAATGTGTAGAGTGTAGGTGGAAAAAGAAGCAGTAGTTTTAACOT 

5 6057 AAT^^GGGAAATGGCTTTCCTAGGCAGTATATGTGGCGTTGGGGTTGGGAATATGGGCACTC^^ 

5 61^8 CTGCCrCAACTAGAT'GTGTGACCCTGAGAAGTTACCTACCCTCGGTTTCCCC^ 

5 6219 agttagtgacagaataaaatgagttaatacatggaacttagaataagacttcata:^^ 

5 6 330 GGGGCAAArAGGGG?^CTGATGG?VTTTGA.GTGGGAAATAGAGAATTAATCTGACTTAAATJ^ 

5 64il CTATAAAGTTTGAATCATAAGACACAGrGATGCTGATGAGACATTGGCCTGGGAGCAGC^^ 

5 6512 CGACAGGTATGrGACTGGACAGGGCACTTCACCTCTTTGC^ 

5 6 6l3 ATJySTTTATCATTCTCATATTGTACAAATAGTTCATTTACTTAGCCTGGGTC^ 

5 6 §§4 CTGTGrGCAGGGACTCTTCTGAACATTTGATATGTTTCT^ACTAArTTAOTCTTTACATT^ 

5 6735 TC7VCAG^TGA7y:;AAACTATGACATGAAGAGGTTAAGTAGCTTGI^^ 

5 6876 CACTCTGGCTCCAG?^GTCCATCCTCTTAATTGCCATGCTGAGCTGTTCCCTCTAGTGACTATATTC^ 

5 6b^7 AGCTTT^TAAGAAATTTATTTTTCTCTCACATTAAATAAGA^^ 

5 7 tfJ 8 AGACCCTGGTTCTTTTCCAATCCTTTGCCATGCCATCCTGGTTCTAGTGrACCCArTC^ 

5 7 W 9 CATCATGACCACATCTAGGCAAGTCAGGAGTAGRAATG^ 

5 7 ^'4 0 GCGATCCCTACCTGCATGGGAGGCTAGGAAGTGTAAGTTTTCAGGTGGTCAC^ 

5 7&i 1 ATTGAGAAAACAACTAACGAATGTTTGTCTGCCACACTGAGGAACCCATGTATGGGCTGTGCTGAAA^ 

5 2 TGGCTACGCCTGTAATCCCAGTACTTTGGGAGGCTGAGGTGGGCGGATCACTTGAGCTC^^ 

57513 AAACCrCGTCTTTACAAAAAATACAAAAAAAATTAACCGGGTGTAGTGG^ 

57604 ATCACTTGAGCCCAGGAGGCAGAGGTTGCAGTGAGCTGAGATCATGCCACTGC^ 

57695 aaatagagggggtac(::aagagatgcagggggggtga.gqscagcat^^ 

57786 aaaatgttactgccatcaaaagccaggaatccttttctggaggcgtaacttcctgc 

57877 ctgtgactgctagaaaaccccaggcatatttgrrctaagaaaatacttgtgttcot^ 

57968 agggaagtctggaatggttgtatcactaagtgagagcagcacagatgtttgtggacct^ 

58059 aagttgtttgcagtgtcattttatgatcttgtg?racattttccaagcgatgtgg(^ 

58150 ttaatcaaataacctasagaatataacccaaatgactgaaaggaagaaa^ 

58241 ttaaagacataaagtaatattactacataaaatctaagitttttactccagct^^ 

58332 attgctgtgtaacaaactacctcacaatttagtggcttaaaagaaaa.^ 

58423 tvtttctactcctgatacttgcctatgatgtggtcatgatggctggggccctagcg^ 

58514 cttggggttggctgggctcagctaagcagtttttgcctggagtctctcagttgcactm 
58 605 ttcctcactcatgtctggtgtctgggctgagaatactcaaacaq^ 
58696 cctgcatgacagctttagggtaactag?vcttgtcacaggatggttcagaa 
58787 ccttttattagccagtcacacagtaccacctccacca 

58878 GGGCAAGGTTCTAGGAGAGCACATTCTTGTGGCCATTTTCAGAAA.TAC^ 

58969 GCTTCTGATTTAATTCTCCCAGCAACCCTAGGAGATTArATGAGCTTATTATCCCCA 

59060 GTCATCCTGCTAATGAGTAGCAGCTTGTTTCAACCGCAGTTGTCTGCATATAAC^^ 
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59151 GAAGCTGTTAGCTAGCTGCTGCTACTTCTGCJWlCJW^SACTA^^ 

59242 aattagctgttagctttggcj«;gactcm*ccc^ 

59333 TTATTTTACACATGGTGGTAATTGTAACTGTTAGTTAAG^ 

59424 CATATGCCTTCCCTTCTAXTTCrrCCCTAAAACCGTTMTTCAIAACCTTAGC W 

59515 GATAGGTGAAAATCCAGTTAAGGGA,GTTCTTTCMiATTCT 

59606 GCTGTGGTGCTAAGTTTATTTTTGCCTAGGCAGGTAGCTCTTCCTTGC^ 

59697 GTTCACTGTTAAGAGGATTTGATAACAATCTCTGGGTATCTCCATA^ 

59788 ATATTCATCACCCACGTCTTATTCATTTGAAGGM^ 

59879 CTTTAAGGTTTATTGMTTGTAAGTTCCCTTGGGTAGAAGAGTACAAAACAAAAGG^ 

59970 TCTTTGGTGTGTACCCTACCGTGCAGGAGACAGGGAAAACACCAG^ 

60061 TTCTCTGTCTTTAATGTTCTTCAGTTTTGTCM'GA.GAAGACTGGATATAI^ 

60152 TCTA!3\AAA3TACC7^TGTTArCTCTTTAAAmTTGTCTCCCTCTC^^ 

60243 TCCTTATATCTCTATGTTTCTTAACCAATTTTTTCATATTTTTTTATTTTTTAAATCTC^^ 

60334 ACTTCTCCAATTATGTCTAGTCTATTATTTAACCTTTAACCTTTCCATTGAGTTT^ 

60425 TTTTTOTTTTTTTGGTGAGACAAGGTCTGGTTCTGTC^ 

60516 AGGCTCAAGCCATCTTCCCACCTCAGCTTCCCAAArAGCTGGGACTGCAGTC^ 

60607 GAGATGGGGTTTTGCCATGTTGCCCAGrCTAGTCTTTAACTTGTGAGCTCAGGCAAT 

6 0 SaS CAGGCATGAGGCACCACACCCASCCTAGTGACTGCAT^ 

6 0 7S9 TTTTITCATTTTCCCTGTTTATTTGTTTTTATCATGGATTTTATACCTTTTATI^ 

6 0 SBO GGATTTTATACATTTTATTACCTCTCTATATTAACTTTGGTTTTTTACATTGTTT^ 

6 0 90 1 TTGATGCATCTGTTAGTTTTCCCTCGTGGTGGTTGGTTTCTTCATATGGriTGTAATTTT^ 
6 1 H2 TCATATGCCCTGATTG^^GAATGTGTTCCTCCAGAACAACTTTATGTTGCT 

6 1 ll 3 TTAAGTTATTTTCTCACCTTGAAGCACATACAGrrCAAGGAATGTACAT^ 

6 1 el 4 TCTCAATATGACTTTCTTTTCCATAAATGGCCCTAAGCTGATAGCAAGTTTT 

6 1 © 5 ccctctttcatagatgggatagcttttgaaggctctggacgatatgcaggtggcatatcca^^ 
6142 6 tc:actaatcttgtgtgagctttgaagccccttcccctcagcccataga.cctatacaca^ 
6lb=i7 gctgccttgtcaccagctcctgtgatcattctagctttgatttttctctttctttc 

eiM&s ttttgctgtgctttcctagcacttccatatgtacgtagcaggaggaggctg 

6 1^09 9 gtgagttttttgtaagtgtaacagcttccattctgcactgtgttttg;^gtct(^^ 

6 i^ao gcaagcatcctaacatttctgagtttcagaaacaaaatagagata?^ 

6 iM 1 tgtaaaacttcttgaaacttcttctggcacatagcaagtcagaggttctc^^ 

6 1^^ 2 ttaacattagacarrtctgggccagaagcaatggcccact^ 

62 0 63 aggaggtcgaagtttgccatgagctgtgatcatgccactgcactccagcctgot 
62 15 4 aggtaaaaaaaaaaaacaaacacagaga.tttctgggtgctacccctc^ 
62245 catn:ttgaaaagctccacaggtgatcccgatatgccacccagtttga 
62336 ttctcagatttctaagcacttcaaagtcatttatttctcccacactg^ 
62427 gtctcatcacggcagaccagagggaatatcagcaggaactcatw^ 

62518 gaaaattccagaactgtacaagccaatattcagagttgagagtcaaaagag^ 
62609 grgcagcgcatggggcctagca2cttggggcatgctctgctgcacttq 

62791 gccagggtattcccagggctgaatgatggcctgrgttggttttttt 

62882 ggtcgcagacctttcacttattatttgctgagttgtcovtgactgat^ 

62973 ctaactgatcttttcttgcttctgtacgctctctttccctctcctccctctcttttcttaat^ 

63064 caggaaatgtgaaacccagttgtcacagggcagctaagaaaagcc^ 

63155 tgctggtacttaaaaaatgggacatttgccacccaggactgactgtaca^ 

63246 aaccatggaattattcccaaatggactctgaccagatttttgccat^^ 

63337 gtatttartaaagtgtgtttttccacaatgtacc:^^ 

63428 tgatttcctctggcccatatttgaatttattgcsvgtaactcaaattgcctgaggaa;^ 

63519 AATTTCACTCTTTATASGAAGGCAGGGCAAACT^ 

63610 t^aaatcatgatgcttggraatttgggaggagattatttgtgaa^ 
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63701 TGTTlTATTArrACTGTTAC^Vn'AATTTAACMG^ 

637 92 AGTCAGCCAGAAATCTXCT^GA.^ 

63883 CTATCTATTTTrCTCCTCTCTGGG?lCCAAGTTTCTTTTTAT7U\A^ 

63974 CMATCTATTVTCAGCTACAAAATM'ATTCAACTTTGACTTCT^^ 

64 0 65 TTAAATTATATATTTTTAATATGACTGTGACCTTGACTGATAAT^^ 

64156 tgctttgtgttttgttttgatc3iccta:tcgctcctaatgtot 

64247 AAATAACACTATCCAAGGTGGCACCTCTTCTGCAATGTTTAACCCTGCIAGTAATC^ 

64338 GTrrATACCATCAGGTATGCATGAATTTATAATCTGAAAG?VGG3^ 

64429 TTTTTTAGGTArTTGGGGAAGAACTCTTTTTAAAGTATACACCTAACTGCTT^ 

64520 tattcccctactctttgggagacactgtgttgagaccaaggtcaaa;^^ 

64 611 ATCATATACCTGTGTAGTAGCCACAGTACAAAACAGACTAGSVACAC^^ 
64702 AACACCTATGGTATTA3ATTCTGCCCTAAAACAATAAG?^GTTAGATGCTAAGT 
64793 CCCTAGTAACCTAGAATATTCCTGATTAAATATCCCCTGCTTTlAGATACrCTGTTGTCCA^ 
64884 CCACAGTGGATACATTTGCTTCATGAGTGCAGGAACCATGTTCA^TGCTGC^ 

64975 CCAATAAATATTTGTTGATTCACTAAArGAATGAArGATGA.GTAGGCCTGCTTC^ 

65066 CATATATAATTTTAAAAATTCTAGTAGCCATATTAAAAATAArAATAGGCCAAGTGC^ 

65157 GACCAAGGTGGGCAGATCACTTGAGCCCAGGAGrTTGAGACCAGCCTGGGCA;^^ 

6524 8 AATTAACCAAGTGTGCTGGCATGTGCCTGTAGTCCCAGCTACTCGGGAGGCTAAGGTGGC5VGGAT 

6 9 CAGTGAGCCATG?VTCGTGTCACTGCACTCTAGCCTGGGTGACAGAGTGAGACCCTG^ 

65 j|0 GCAAA7OTAACTTTACTAGTATAITTAACCCAATATATATAAAATATTATO 
655^1 TATTTCCM'ATTAAGTCTTTAAAAATCTGATGTGTAGTTTGTACTTA^^ 
65^jL2 GCCACTVGGGGGCCACTGGCTACCATATTGGATASTGCCATTCTAGAAG^ 
6^'7|)3 AATACAGATAACCAAAGAAGTGGGACTAGTGTCTGAAGTAAGAATGACAGGGTAT 
6 Sf$ 4 TTGTGGGGTTGCAGAATAAGGATTTGTCAATATTGGCTCTAGCTGTTC^^ 

6 sip 5 ATAGTTAAGTGGGGAGCATGGCTGCACTTTTTT^GTGM'GGCACAAAAAAAC^^ 

6 ^,9 7 6 TGCAGTTAGAAAAGAGGCCTTrrAAGAATCCCTAAGAGTAAAGCAAATTAGTATCTTTGTTTCCTGAAAAT^ 

6 4GK67 TGATAGCCCTCTTCATTTTATTTGGAAAACTCTTCTATGAAAGCriT 

6 61j5 8 CT^GGTCAAGGGTGCACTTGTCACTATCACATAAGAATCTC^^ 

6 9 CCACACATTGTTTTAAATTCCArAATTCTATTCTATAAAGAGTGGTTTC^ 

6 §M 0 TTAGAGTTGTCATTGGrAATTGTGGTTGCAAGTATGCTTTCAAAGA.CCAGAAG^ 

6 1 TTTTTGATACGGRGTCTCACTCTGTTGCCCAGGCTGGAGTGCATTGGCACCATCTm 

6 6^12 2 ATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGGCGTCCACCACCACGCCTGACTA^ 

66613 TTACCATGTTGGCCAAGCTGGTCTCAAATTCCTGACCTCAGGTGATCCACCTGCCTC^ 
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Putative promoter sequence of human CLASP-5 



GGAACAATTTCCTCTCATGTGTATGGCTCCCTAAAGTGTTGGCTGAGCATTGTCCACATGGGTG 
ATGCAAAGGATCACTGAACTAGGAGCAGTTGGGAAAAAATACAATCATTGGGAATTCCTGTAGC 
ATCGAATGTGCCTACAGGGAGGTAGAAGTATTCATACAACAGTTCTCTGGTGTTCTCTGTTGTA 
GCAACCAGTCAGCCAAAAGGGTTCAGCTGCTTGAAATGAGAATGGCTGGATCAAAATGGCAGCT 
CATGATTTAAAGGATTCTAGTCAGATACCAGACATCCTCACATAGAGAAAACTCTGAATGGCTG 
GGGGAGAAGGAGTCAAATGCCCTGGATCTTTTTCTTGGGCCTCAAAGTCCTCCTTCTGTCATCA 
TCCTTCCAGTATTGGGCAGGACCTGACTGCAGGCATCATGGCCTCTGTGAACTTCTCAAGGGTA 
TGTATTATCTGACAAAAACTACGATGTCCACTAACAGGCCACTGAAAGGTATCTTAGTCAGTTC 
TGCTCATTGCCCAGCCAAGGCCTACGTTTTATAACATGATATCAAAGATTGCATCTAAAATTGT 
GATGATTTCCTAAAATAATCATTTCATTTAGATTTTTCTATTTTAATCCAAGGTATTCTTCAGC 
GGAAATAAGGAAACAGTTTACTCTCCCACCAAACCTTGGCCAGTACCATCGACAGAGCATAAGT 
ACCTCTGGCTTCCCCTCTCTTCAACTAGTAAGTATGAGTTCCAGGTTTACTTAGCGATTGGTCA 
AGTGCAAAAGTGCCCAGGGTATGTGTTTGCCTCCTGTTCCTTAGATCTTCCTACCATCACCTCA 
□ CATTCTCCAGTCACCAGATCCTAACTCTGTGACTGTGTCTGGACATCAGACAATATCCCTCTCT 
CTCTCTGCCAACCGGTACTTAGGGTACATAATAGAACCTCTGGGAGCTGTGGTTTTGATGTCTC 
TAGACTAGGTGGGCTTCCAGGTGACTCAGTCTCATCCAAATTATGGTTCATATTTGGGGGAGAA 
GGGCTAGCCCAAAAACTTACCACCATTTGTAGTATGCATTTTTTTGGAAAAGCATATTCCAAAA 
TCTGAAATGCCAAGTTACAGACCTCCTTTTTGTAAAATAATTTTCTTGCTAGTATAATTTACAT 
r| ATAATAAAATTCACACATTTTAGGTGTACAATTTGGTGAACTTGGGCAACTTAGAGTCACTTAA 
Q CCTTTCCTCAGTCAAGATATAGAACACTTCTTTTATCCT7\AAGCGTTCCCCAGCGCGCTTTTAC 
AATCTCCTCTCCCCAGGCCACACCCTCC7U\CTCACGCAATCTCTGACTCACTTCTGTCACCATA 
ATTTTGCTCTATCTGGAGCTTCATATCCTGTTACAGTATGTACAAACCTTCTTTTTTTGAGACA 
111 GGGTGTCAGTCTGTCACCCAGCCTGGAGTACAGAGGTGTGATCTCAGCTCACTGCAACCTCAAC 
h^" CTCCCAGGATCAGATGATTCTCCTCCCACCTCATCCTCCCAAGTAGCCGGGACTACAGGCGCAT 
W GCCACCACACCTGGCTAATTTTTGTACTTTTTGTAGAGACAGGGGTCTCGCTATGTTGCCCAGG 
CTGGTCTTGAACTCCTGGGCTCAAGCGATCCTCCTGCCTCAGCCTCCCAAAGTGCTGGGATTAC 
^=3 AGTGAGCCACTGCACCTGGCCCTAAACCTTCATTTTTAAAACACATTTCCTCTTAAATTGAAGA 
TTGCCTACATTTTTATATCAATGCCAATTGTTGAGTGTGCCTATATGTGTTATATTATTTGAGC 
ACTAAATGCCAGATGTGTGCCAAGTGAGATAAATCTGACAAATGAGATGGTTTGTAAAACCAGC 
AGTGAATATTCACTTCCTCTGTGAGAGAGCTCCAGCCCTCCTGTACTCACTTCCTCACACAGCA 
CAGCAGCACTCTTGCTGGTTCTGCTGCTTATCTTGAAGAGGTTAGGTTACTTTTTGTTTCTACT 
TATTACTTCGAAACCACTTCTGCCTTAGAAATTTTGTAACCTTCCGCTCAGTTTC CGGTAACCG 
CCATTTTGTCTCCTGTAACT^TTTACGCGCCGTGTAACTGTGAATCTTT 
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hCIASP4 
hCIASPS 
hCLASP3 
hCLASP2 
hCLASP? 
hCLASPl 



MFPMEDISISVIGRQRRTVQ 20 

MTHLNSLDVQLAQELG 16 

MAERRAFAQKISRTVAAEVRKQISGQYSGSPQLLKNLNIVG 41 

MLLFPYDDFQTAILRRQGRYICS 23 

MAASERRAFAHKINRTVAAEVRKQVSRERSGSPHSSRRCSSSL 4 3 

MSFRGKVFKREPSEFWKKRRTVRRVIQEEFHRFSSQEKPRLLEPLDYETVIEELEKTYRN 60 



hCLASP4 STVPEDAEKRAQSLFVKECIKTYSTDWHWNYK 53 

hCLASPS DPT 19 

hCLASPS N ISHHTTVPLTEAVDPVDLEDYLITHPLAVDSGPLRDLIEFP 83 

hCLASP2 TVPAKAEEEAQSLFVTECIKTYNSDWHLVNYK 55 

hCLASP? G VPLTEWEPLDFEDVLLSRPPDAEPGPLRDLVEFP 79 



hCLASPl DPLQDLLFFPSDDFSAATVSWDIRTLYSTVPEDAEHKAENLLVKEACKFYSSQWHWNYK 120 



hCLASP4 YEDFSGDFRMLPCKSLRPEKIPNHVFEIDEDCEKDED SSSLCSQKGGVIKQG 105 

hCLASPS DDDLDWFTPKECRTLQP-SLPEEGVELDPHVR DCVQTYIREWLI 63 

hCLASPS PDDIEWYSPRDCRTLVS-AVPEE-SEMDPHVR DCIRSYTEDWAI 126 

hqLASP2 YEDYSGEFRQLPNKWKLDKLPVHVYEVDEEVDKDED AASLGSQKGGITKHG 107 

hC5LASP7 ADDLELLLQPRECRTTEP-GIPKD-EKLDAQVR AAVEMYIEDWVI 122 



h|^LASPl YEQYSGDIRQLPRAEYKPEKLPSHSFEIDHEDADKDEDTTSHSSSKGGGGAGGTGVFKSG 180 



HCLASP4 WLHKANVNSTIT—VTMKVFKRRYFYLTQLPDGSYILNSYKDEKNSKESK-GCIYLDACI 162 

HgLasps VNRKNQGSPEIC—GFKKTGSRKDFHKT-LPKQTFESETLECSEPAAQA—GPRHLNVLC 118 

hllLASP3 VIRKYHKLGTGF— NPNTLDKQKERQKG-LPKQVFESDEAPDGNSYQDDQDDLKRRSMSI 183 

hSoAS P2 WL YKGNMNS AI S - - VTMRS FKRRFFHL I QLGDGS YNLN FYKDEK I SKEPK-GS I FLDSCM 164 

hCLASP7 VHRRYQYLSAAY—SPVTTDTQRERQKG-LPRQVFEQDASGDERSGPEDSNDSRRGSGSP 179 

hSLAS P 1 WLYKGNFNS TVNNTVTVRS FKKRYFQLTQLPDNS YIMN FYKDEK I SKEPK-GC I FLDSCT 239 

HciASP4 DWQCPKMRRHAFELKMLDKYSHYLAAETEQEMEEWLITLKKIIQINTDSLVQEKKETVE 222 

rigLASPS DVSGKGPVTACDFDLRSLQPDKRLENLLQQVSAEDFEKQNEEARRTN RQAE 169 

hgLASP3 DDTPRGSWACSIFDLKNSLPDALLPNLLDRTPNEEIDRQNDDQRKSN RHKE 234 

mLASPl GWQNNKVRRFAFELKMQDKSSYLLAADSEVEMEEWITILNKILQLN FEAAMQEK 219 

hCLASP7 EDTPRSSGASSIFDLRNLAADSLLPSLLERAAPEDVDRRNETLRRQH RPPA 230 

hCLASPl GVVQNNRLRKYAFELKMNDLTYFVLAAETESDMDEWIHTLNRILQISPEGPLQGRRSTEL 299 



hCLASP4 TAQDDETSS QGKAENIMASLERSMHPELMKYGRETEQLNKLSRGDGRQNLFSFDSE 278 



hCLASPS LFALYPSVD EEDAVEIRPVPECPKEHLG N RILVKLLTLKFEIE 212 

hCLASP3 LFALHPSPD EEEPIERLSVPDIPKEHFG QRLLVKCLSLKFEIE 277 

hCLASP2 RNGDSHEDD EQSKLEGSGSGLDSYLPELAKSAREAEIK LKSESRVKLFYLDPD 272 

hCLASP7 LLTLYPAPD EDEAVERCSRPEPPREHFG QRILVKCLSLKFEIE 273 



hCLASPl TDLGLDSLDNSVTCECTPEETDSSENNLHADFAKYLTETEDTVKTTRNMERLNLFSLDPD 359 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



VQRLDFS GIEPDIKP-FEEKCNKRFLVNCHDLTFNILGQIGDNAKGPPTNVEPFFI 333 

lEPLFAS IALYDVKERKKISENFHCDLNSDQFKGFLRAHTPSVAASSQARSAVFSV 2 68 

lEPI FAS LALYDVKEKKKI SENFYFDLNSE®1KGLLRPHVPPAAI TTLARSAI FS I 333 

AQKLDFS SAEPEVKS-FEEKFGKRILVKCNDLSFNLQCCVAENEEGPTTNVEPFFV 327 

IEPI FG I LALYDVREKKKI SENFYFDLNS DSMKGLLRAHGTHPAI STLARSAI FSV 329 

IDTLKLQKKDLLEPESVIKP FEEKAAKRIMI I CKALNSNLQGCVTENENDPI TNI EPFFV 419 
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hCIASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASPS 
hCLASP2 
hCLASP? 
hCLASPl 



FKSHLESTIYTQDLHVHKFFHHCQLIQS - 

IEVQAVSSVHTQDNHLEKFFTLCHSLES3VTFPIRVLDQKISEMALEHELKLSIICLNSS 
VEWAVSS IHTQDPYLDKFFALVNALDE H- 
ISTHLVSTVYTQDQHLHNFFQYCQKTES - 
VELTAVSSVHPQDPYLDKFFTLVHVLEE 3- 
VSTFWSTVNTQDPHVNAFFQECQKREK D- 



GSKEVPGELIKYLKCLHAM 794 

715 

LFPVRIGDMRIMENNLENELKSSISALNSS 780 

GAQALGNELVKYLKSLHAM 787 

AFPFRLKDTVLSEGNVEQELRASLAALRLA 767 
MSQSPTSNFIRSCKNLLNVE 887 



EIQVMIQFLPVILMQLFR VLTNMTH EDDVP 824 

RL E PLVLFLHLVLDKLE QLSVQPMVIAGQTANFSQFAFES WAIANSLHNSKDLSKDQHG 775 
QL EPWRFLHLLLDKLI LLVIRPPVIAGQI VNLGQASFEAMAS I INRLHKNLEGNHDQHG 840 
EGHVMIAFLPTILNQLER VLT-RAT QEEVA 816 

SPEPLVAFSHHVLDKLVRLVIRPPIISGQIVNLGRGAFEAMAHVVSLVHRSLEAAQDARG 827 
KI HAIMSFLPI ILNQLE K-------- VLVQNE EDEIT 916 



INCTMV-LLHIVSKCHEEGLDS YLRSFIKYS FRPEKP 860 

RNCLLASYVHYVFRLPEVQRDVPKSGAPTALLDPRSYHTYGRTSAAAVSSKLLQARYMSS 835 
RNSLLASYIHYVFRLPNTYPNSSSPG-PGGLGGSVHYATMARSAVRPASLNLNRSRSLSN 899 

VNVTRV-IIHWAQCHEEGLES HLRSYVKYA YKAEPY 852 

HCPQLAAYVHYAFRLPGTEPSLPDGAPP VTVQAATLARGSGRPASLYLARSKSXSS 883 

TTVTRV-LPDIVAKCHEEQLDH SVQSYIKFV FKTRAC 952 



hCfi7VSP4 
hbiASPS 
haiASP3 

h(iLASP7 
heLASPl 



SAPQAQLIH ETLATTMIAILKQS 883 

SNPDLAGTHSAADEEVKNIMSSKIADRNCSRMSYYCSGSSDAPSSPA 882 

SNPDISGTPTSPDDEVRSIIGSKGLDRSNSWVNTGGPKAAPWGSNPSPSAESTQAMDRSC 959 

VASEYKTVH EELTKSMTTILKPS 875 

SNPDLAVAPGSVDDEVSRILASKLLHEELA-LQ 915 

KE RPVH EDLAKNVTGLLKSN 972 



hCLASP4 
hplASPS 
hCLASP3 
hBlASP2 
hittLASP7 
hClASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



ADFLSINKLLKYS WFFFEIIAKSM 907 

APRPASKKHFHEEUU^ MWSTGMVKSM 910 

NRMSSHTETSSFLQTLTGRLPTKKLFHEEIJaQWWCSGSVRESALQQAWFFFELt^ 1019 

ADFLTSNKLLRYS WFFFDVLIKSM 899 

WWSSSAVREAILQHA WFFFQLMVKSM 94 2 

DSPTVKHVLKHS WFFFAIILKSM 995 

. * : *** 

Cadher in Cle avage 
AT YLLEENKI KLmGQRTPET YHHVLHSLLLAI I PHVT I RYAE I PDE- - - SRNVldsLAS 964 
AQHVHNMDKRDS I RRTR FSDRFMDDI TT IVNWTSE I AALLVKPQKENEQAEKMN3 SLAF 970 
VHHLYFNDKLEAI RKSR ?*PERFMDDIAALVSTIASDIVSRFQKDTEM~VERLN1 SLAF 1076 

AOHLIENSKVKLI RNQR rPASYHHAAETWNMLMPHITQKFGDNPEA SKNANK SLAV 956 

ALHLLLGQRLDTE RKLR fPGRFLDDITALVGSVGLEVITRVHKDVEL AEHLKf SLAF 999 

AQHLIDTNKIQLE RPQR rPESYQNELDNLVMVLSDHVIWKYKDALEE—TRRATE SVAR 1052 



FLKRCLTLMDRGFIF f^LINDYISGFSPKDP ] 



FLYDLLSLMDRGFVF l^LIRHYCSQLSAKLSNL E 



-i^ VLAEYKFEFLQTICNHEHYIPLNL 
TLISMRLEFLRILCSHEHYLNLNL 
FLNDLLSVMDRGFVF SLIKSCYKQVSSKLYSLPNPS \n.VSLRLDFLRIICSHEHYVTLNL 

FIKRCFTEWDRGFVF KQINNYISCFAPGDP F TLFEYKFEFLRWCNHEHYIPLNL 

FLSDLLSLVDRGFVF SLVRAHYKQVATRLQSSPNP^ ALLTLRMEFTRILCSHEHYVTLNL 
FLKRCFTEMDRGCVF KMVNNYISMFSSGDL » TLCQYKFDFLQEVCQHEHFIPLCL 



1019 
1027 
1136 
1011 
1059 
1107 
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hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



Cadherin EC m otif 

PMAFAKPKLQR VQDS—KLEYSLSDEYCKHHFLVGljLLREjTS I 1060 

FFMNADTAPTSP—CPSISSQNSSSCSSFQDQKIASMFDLTSEYRQQHFLTGI LFTE LAA 1085 
PCSLLTPPASPSPSVSSATSQSSGFSTNVQDQKIANMFELSVPFRQQHYLAGI VLTE LAV 1196 

PMPFGKGRIQR YQDL— QLDYSLTDEFCKNHFLVGI LLRE! VGT 1052 

PCCPLSPPASPSPSVSSTTSQSSTFSSQAPDPKVTSMFELSGPFRQQHFLAGI LLTELAL 1119 
PIRSANIPDPLTP SES -TQELHASDMPEYSVTNEFCRKHFLIGl LLRE VGF 1157 



ALQDN YE I RYTAI SV I KNLLI KHAFDTRYQHKNQQAKI AQLYI,PFVGLLLEN; 

ALDAEGEGIS^ VQRKAVSAIHSLLSSHDLDFRCVKPEVKVKIAALYLPLVGIILDALt 
ILDPDAEGLFG UiKKVINMVHNLLSSHDSDPRYSDPQIKARVAMLYLPLIGIIMETV P- 



I2RL 



1116 
- 1143 
~ 1254 

ALQEFR----ElVRLIAISVlJCNLLIKHSFDDRYASRSHGARmTLYLPLFGLLIE 1108 

— 1177 
1213 



alepeiaegafiJlhkkai savhsllcghdtdpryaeatvkarvaelylplls iardtiJp 

ALOEDQ : 



E ^mHIJUuAVLKNI2^AKHSFDDRYREPRKQAQIASLYMPLYG^aiLDNM PRI 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hSOLASPl 



AGRDTLYSCA AMPN-S ASRDEFPCGFTSPANRGSLSTDKDTAYGS 1160 

CDFTVADTRRYRTSGSD 1162 

QX.Y DFTETHNQRGRPICIATDD — 1276 

NVRDVSPFPVNAGMTNnKDESIiALPA-VNPLVTPQKGSTLDNSLHKDIJiG^ 1167 

RLH DFAEGPGQRSRLASMLDSDTE 1201 

YLKDLYPFTVNTSNQGSRDDLSTNGGFQSQTAIKHANSVDTSFSKDVLNSIAAFSSIAIS 1273 



hgLASP4 
hfjLASPS 

3$KlASP7 
H^sy^pi 



FQ-NGHGIKREDSRGSLIPEGATGFPDQGNTGEN TRQSSTRSSVSQYNRLDQYE 1213 

EEQEGAGAINQNVALAIAGNNFNLKT SGIVLSSLPYKQYNMLNADT 1208 

YESESGSMISQTVAMAIAGTSVPQLTR PGSFLLTSTSGRQHTTFSAES 1324 

STPNINSVRNADSRGSLISTDSGNSLPERNSEKSNSLDKHQQSSTLGNSWRCDKLDQSE 1227 

GEGDIAGTINPSVAMAIAGGPLAPGSR ASISQGPPTASRAGCALSAES 124 9 

TVNHADSRASLASLDSNPSTNEKSSEKTDNCEKIPRPLALIGSTLRFDRLDQAE 1327 



hClLASP4 
h#LASP5 
n.®iASP3 
hgLASP2 
h&ASP7 
htLASPl 



] RSLLMCYLYIVKMISEDTLLTYWNKVSPQELINILILLEVCLFHFRYMGKRNIAR WDA 1273 
t|rN124ICFLWIMKNADQSLIRKW1ADLPSTQI^ILDLLFICVLCFEYKGKQSS /STQ 1268 
RSLLICLLWVLKNADETVLQKWFTDLSVLQLNRLLDLLYLCVSCFEYKGKKVFER 4NSL 1384 
]1KSLLMCFLYILKSMSDDALFTYWNKASTSED4DFFTISEVCLHQFQYMGKRYIAR?JQE^ 1287 
£ RTLIACVLWVLKNTEPALLQRWATDLTLPQLGRLLDLLYLCLAAFEYKGKKAFER INSL 1309 
':jRSLLMCFLHIMKTISYETLIAYWQRAPSPEVSDFFSILDVCLQNFRYLGKRNIIRKIAA 1387 



hCLASP4 
hCLASPS 
hCIASP3 
hCLASP2 
hCLASP7 
hCLASPl 



WLSKHFGIDR— 
VLQKSRDVKAR- 
TFKKSKDMRAK- 
LGPIVHDRKS— 
TFKKSLDMKAR- 



— KSQTMPALRNRSGVMQARLQHLSSLESS 1311 

'LEEALLRGEGARGEMMRRRAPGNDRFPGLNEN 1311 

-LEEAILGSIGARQEMVRRSRGQLERSPSGSAFGSQ 1430 

QTLPVSRNRTQdMHARLQQLGSLDNS 1323 

'LEEAILGTIGARQEMVRRSRERSPFC24PEN^ 1350 

- 1442 



AFKFVQSTQNNGTLKGSNPSCQTSGLLAQWMHSTSRHEGHKQHRSQTLPIIRGKN- 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



— ALSNPKLLQWLDNTMTSNSNE ] 



FTLNHSSTTTEiJDIFHQALLEGNTATEVSLTVLDTISFFrOCFKTQLL 1359 
-'LRWKKEQTHWRQANEKLDKTKELDQEALISGNLATEAHLIILrHQENI EQASS-ALD 1368 
ENLRWRKmTHWRQNTEKLDKSR? EIEHEALIDGNLATEANLIILDTLEIV /QTVS-VTE 1489 

LTFNHSYGHSD; DVLHQSLLEAMIATEVCLTALDTLSLF TLAFKNQLL 1371 

- -VRWRKSVTHWKQTSDRVDKTKi}eMEHEALVEGNLATEASLVVIiDTLE I ij^QTVM-LSE 1407 

1500 



DIVHHVDTEANLATEGCLTILDLVSLF rQTHQRQLQ 
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hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP? 
hCLASPl 



NNIXSHNPLMKKVFDIHIAFIiOIGQSEVSLKHVFASLRAFISK^^ 1419 

CKDS LLGGVLRVLVNSLNCDQSTTYLTHCFATLRALIAKFGDLLFEEEVEQCFDLCH 1425 

SKES ILGGVUCVLLHSMACNQSAWLQHCFATQRALVSKFPEUiFEEETEQCADLCL 154 6 

ADHGHNPIiiKK\TOVYLCFLQKHQSETALKWFTAIJl^ 1431 

ARES VLGAVLKWLYSLGSAQSALFLQHGLATQRALVSKFPELLFEEDTEIiCADLCL 1464 

QCDCQNS IMa^Gn)TYMLFFQVNQS ATALKHVFASLRLFVCKFPSAFFQGPADLCGS FCY 1560 



hCIASP4 
hCLASPS 
hCLASPB 
hCLASP2 
hCLASP7 
hCLASPl 



EVLKCCTSKISSTRNEASALLYLIMOWFEYTKRKTFLRTHLQIIIAVSQLIADVALSGG 1479 
QVLHHCSSSMDVTRSQACATLYLIMR — FSFGATSNFARVKMQVTMSLASLVGRAPDFNE 1483 
RLLRHCSSSIGTIRSHPSASLYLIJ4R---QNFEIGlWFARVKMQVPMSLSSLVGTSQNFra 1604 
EILKCCNSKLSSIRTEASQLLYFIJ4RNNF|)YTGKKSFTOTHLQVI ISVSQLIADWG • 14 91-- 
RLLRHCGSRISTIRTHASASLYLLMR — QNFEIGHNFARVKMQVTMSLSSLVGTTQNFSE 1522 
EVLKCCNHRSRSTQTEASALLYLEMRKNFEFrn<QKSIVRSHLQLIKAVSQLIABAG-IOT 1 61 9 



hCLASP4 
hCIASPS 
hCLASP3 
hCLASP2 
h(^lASP7 

K^Slaspi 



hg:jvsP4 
rieiiASPS 

h&ASP3 

hSu^P2 

hOLASPT 
hCLASPl 



hCLASP4 
hCLASPS 
lv(tASP3 
h-(iLASP2 
HoLASP7 
hCLASPl 



SRFQESLFI INNFANSDRPMKATAFPAEVKDLTKRIRTVimTAQg«KEHEKDPEMLIDLQ 

EHI4^RSLRTILAYSEEDTAMGWTPFPTQVEELLCNLNSILYDTVKMREFQEDPEMIM5LM 

EFLimSLKTILTYAEEDLELRETTFPIXJVQDLVFOTilMILSDTVKMKEHQEDPE^ 

TRFQQSLSIimCAKSDRLIKHTSFSSDVKDLTKRIRTVLMATAQWKEHENDP^^ 

EHLRRSLKTILTYAEEmGLRDSTFAEQVQDLMFNLHMILTDTVKMKEHQEDPEMLIDLM 

SRFQHSLAITNNFANGDKQMKNSNFPAEVKDLTKRIRTVLbdATAQHKEHEKDPEMLTO^ 

* • * * • * * • -k • it * ^ * > -k-k it -k-k * ^ -k 

transmembrane 

YS LAKS YAS TPELRKTWLDSMAK I HVKNGljFSEAAMCYVHVAALVAEFljHRKK 

YRIAKS YQASPDLRLTWLQNMAEKHTKKKC YTEAAMCLVHAAALV?£YI SMLEDK 

YRIAKGYQTSPE-RLTWLQNMAGKHSERS> HAEAAQCLVHSAALVAEYI SMLEDR 

YSLAKSYASTPELRKTWLDSMARIHVKNGI LSEAAMCYVHVTALVAEYI TRKG 

YRIARGYQGSPt)LRLTWLQNMAGKHAELG>HAEAAQCMVHAAALVAEYI ALLEDQ 

YSLANSYASTPELRRTWLESMAKIHARNGI LSEAAMCYIHIAALIAEYI ECRKGYWKVEKI 



, * M k kkik • kk it 



titk-k k • -k < ** • *k , 



1539 
1543 
1664 
1551 
1582 
1679 



1592 
1598 
1718 
1604 
1637 
1739 



LFPNGCSAFKKITPNIBEEGAMKEDAGMMD 1622 

S YLPVGS VS FQN I S SNVLEESWSEDTLSPDEDGV 1633 

KYLPVGCVTFQNISSNVLEESAVSDDWSPDEEGI 1753 

VFRQGCTAFRVITPNIDEEASMMEDVGMQD 1634 

RHLPVGCVS FQNI S SNVLEESAI S DDI LS PDEEGF 1672 

CTASLLSEDTHPCDSNSLLTTPSGGSMFSMGWPAFLSITPNIKEEGAAKEDSGMHD 1795 



ITAM 



.hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



VHYSEEVLLELLEQCVDGLWKAERYEIISEISKLIVPIYEKRREFEKLTQV YRTI HG 1679 

CAGQYFTESGLVGLLEQAAELFSTGGLYETWEVYKLVIPILEAHREFRKLTLT HSKI QR 1693 
CS GKYFTESGLVGLLEQAAAS FSMAQIYEAVNEVYKVLI P I HEANRDAKKLST I HGKI QE 1813 

VHFNEDVIMILLEQCADGLWKAERYELIADIYKLIIPIYEKRR 1677 

CSGKHFTELGLVGLLEQAAGYFTMGGLYEAVNEVYKNLIPILEAHRDYKKIA^^ HGKI QE 17 32 
TPYNENILVEQLYKCGEFLWKSERYELLADVNKPI lAVFEKQRDFKKLSDI YYDI HR 1852 



ITAM 



DOCK motif 



DOCK motif 



YTK3 LEVMHTKKRLLGIFFRAJ AF YGQS FFEEEI GKE^ I YKEEKLTGLSEISLRLVKI YG 
I FDSl VNKDH— KRMFG1 YFR\ GF FG-SKFGDLEEQEi V YKEI AITKLPEISHRLEA1 YG 
I FSKl VHQSTGWERMFG1 YFR\) GB YG- 1 KFGDLI EQE I V YKEI AI TKLAE ISHRLEGI YG 

£ FFEDEE GKE^ I YKEI KLTPLSEISQRLLKI YS 

? FTKl MHQS SGWERVPGl YFRV GF YG-i^ HFGDLI EQEE V YKEI SITKLAEI SHRLEEl YT 
< YLK\ AEWNSEKRLFGB YYR\?AFYGQGFFEEEEGKE1i I YKEEKLTGLSEISQRLLK3 YAj 1912 



ITAM 
1739 
1750 
1872 
1710 
1791 
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TTAM 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASPS 
hCLASP2 
hCIASP7 
hCLASPl 



hCIiASP4 
hCLASPS 
hCIASP3 
hCLASP2 
hCLASP? 
hGiASPl 



hc6ASP4 
hCjASPS 
hC3^SP3 

hciASPa 

hGMASP7 
hdSASPl 



hCpASP4 
hGLASPS 
hG;*ASP3 
hC:£ASP2 
hdgASPT 
hdlASPl 



mm 



EK FGTENVKIIQDSDKVNAKELDPi YAHIQ\ T rVKI YFDDKELTERKTEFERNHNISRFV 
QCFGAEFVEVIKDSTPVDKTKI.t>P>KAYIQ3 TFVEI YFDEYEMKDRVTYFEKNFNLRREM 
EP FGEDWEVIKDSNPVDKCKLDPl^KAYIQl T fVEI YFDTYE^^KDRITYFDKNYNLRRn^ 
DK FGSENVKMIQDSGKVNPKDLDSI YAYIQ\ 1 tJVII FFDEKELQERKTEFERSHNIRRra 
EF FGDDWEI IKDSYPVDKSKLDSC KAYIQ] 1 rVEI YFDTYELKDRVTYFDRNYGLRTFL 
DK FGADNVKIIQDSNKVNPKDLDPJ YAYIQ\ T rVTI FFEEKEIEDRKTDFEMHHNINRFV 



ITAM 



FEAPYTLSGKKQGCIEEQCKRRTILTTSNSFP mCf RIPINCEQQIt LKPIDGATDEIKD 
YTTPFTLEGRPRGELHEQYRRNTVLTTMHAFF "ilKl RISVIQKEEF\ LTPIEVAIEDMKK 
YCTPFTLDGRAHGELHEQFKRKTILTTSHAFF SflKT RVNVTHKEEII LTPIEVAIEDMQK 
FEMPFTQTGKRQGGVEEQCKRRTILTAIHCFP ma RIPVMYQHHTI LNPIEVAIDEMSK 
FCTPFTPDGRAHGELPEQHKRKTLLSTDHAFF yiKI RIRVCHREE'H LTPVEVAIEEMQK 
FETPFTLSGKKHGGVAEQCKRRTILTTSHLFF YVKI RIQVISQSSTI LNPIEVAIDEMSR 



Coiled-coil 



1799 
1810- 
1932 
1770 
1851 
1972 



DOCK motif 



1859 
1870 
1992 
1830 
1911 
2032 



KTAELQKLCS S TDVDMI QLQLKLQC WVSVQVNAGPLAYARAFLNDSQASKYPPKKVSELK 



KTLQIAVAINQEPPDAKMLCMVLQC 
KTQElAFATHQDPABPKMLC^miQC 
KVAELRQLCSSAEVmiKLQLKLQC 



SVGATVNQGPLEVAQVFLT^IPADPKLYRHHNKLR 
S VGTTVNQGPLEVAQVFLSEI PSDPKLFRHHNKLR 
SVSVQVNAGPLAYARAFLDDTNTKRYPDNKVKLLK 
KTRELAFATEQDPPDAKMLQMVLQC SVGPTVNQGPLEVAQVFLAE I PEDPKLFRHfiNKLR 
KVSELNQLCTMEEVDMISLQLKLQC SVSVKVNAGPMAYARAFLEETNAKKYPDNQVKLLK 



EI 



Coiled-coil 



DMFRKFIQACS : ALELNERIilKEDQVEYHEGLKSNFREMVKELS DlllHEQILQEDT^ 
LCFKEFI^mCGIAVEKNKRLITAIX5REYQQELKKNY^^CLKENLRP^lIERKIPEL^^^ 

LCFKDFTKRCEI )ALRKNKSLIGPVQKEYQRELGKLSSP 

EVFRQF\^CG(:!AIAVNERLIKEr>QLEYQEEMKA[ra<EMAKELSE^^ 
LCFKDFCKKCEI >ALRKNKALIGPDQKEYHRELERNYCRLREALQPI LTQRLPQLMAPTP- 



FRQFADACG^ALDVNERLIKEDQLEYQEELRSHYKDMLSELST\f^NEQITGRDDLSKR 



PD Z liqa nd 
WMSNTLHVFCAISGTSSDRGYGSP HY^yi — 2008 
VESQKRDSFHRSSFRKCETQLSQGS 2015 

VLPNSLHIFNAISGTPTSTMVHGHTS dsSWi 1980 

— PGLRNSLNRASFRKADL ^ ~ 2047 

GVDQTCTRVISKATPALPTVSISS ^AEV^ — 2180 



1919 
1930 
2052 
1890 
1971 
2092 



1979 
1990 
2090 
1949 
2030 
2152 
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